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WO 00/61813 PCT/USOO/09558 
5 A SECOND MAMMALIAN TANKYRASE 

RELATED APPLICATIONS 

5 

10 This application claims the priority basis of U.S. Provisional Patent Applications 60/128,577. filed April 

9, 1999; and 60/129,123. filed April 13, 1999. For purposes of prosecution in the U.S.A.. the priority documents 
are hereby incorporated herein by reference in their entirety. 

10 TECHNICAL FIELD 

15 

This invention relates generally to the field of molecular biology of telomere and telomere associated 
proteins, and the maintenance of telomere structure. More specifically, this invention relates to a novel protein 
that shares three domains of homology with the telomerase associated protein Tankyrase I. 



15 

20 



BACKGROUND 



Recent research has described what may be a key switch in the control of cellular aging. The 
telomeres at chromosome ends are made up of multiple repeats of the DNA sequence TTAGGG, which are 
25 20 thought to stabilize the chromosome during replication. Telomeres shorten each time the cell divides, and cells 

become senescent when the telomeres are too short to protect the chromosome. But in some cells, including 
embryonic cells, an enzyme called telomerase rebuilds the telomeres after each division, extending the 
implicative capacity of the cell. (Bodnar et al.. Science 279:349, 1998; Hariey et at., Curr. Opin. Genet. Dev. 
5:249. 1995). 

30 25 Regulation of telomerase activity is a complex process Involving several protein components. Two 

such proteins have DNA binding activity, and are named tetomeric repeat binding factors (TRF1 and TRF2). It 
is thought that TRF1 is involved in regulating telomere length, because overexpression of wild-type TRF1 
makes telomeres shorter, while overexpression of a dominant-negative form of TRF1 makes telomeres longer 
— perhaps by affecting the access of telomerase to the chromosome terminus (van Steensel et al.. Nature, 

35 20 335:740, 1997). TRF1 promotes parallel pairing of telomeric tracts, apparently pairing in parallel homodimers 

that form filamentous structures on longer telomeric repeat arrays (Griffith et al.. J. MoL Biol. 278:79-88, 1998). 

The role of TRF2 appears to be protection of the chromosome terminus, since expression of a 
dominant-negative form of TRF2 leads to chromosome-chromosome fusions. (Griffith et al., J Mo! Biol. 278:79, 
1998: Broccoli et al., Nature Genetics 17:231, 1997: van Steensel et al., Cell 92:401, 1998). This in turn leads 

40 

35 to p53- and ATM-dependent apoptosis of the cell (Kartseder et al., Science 283:1321 , 1 999). TRF1 and TRF2 
have been implicated in large duplex loops at the end of telomeres that may provide a general mechanism for 
telomere protection and replication (Griffith et al.. Cell 97:503, 1999). 

Smith et al. (Science 282:1484. 1998; Genomics 57:320, 1999; J. Cell Sci. 112:3649. 1999) have 
45 reported a novel protein that associates with TRF1 , which they named "Tankyrase". A yeast two-hybrid screen 

40 was used with human TRF1 as bait, and yielded two overlapping cDNAs which provided the full-length 
sequence. Northern blot analysis revealed that multiple mRNAs were ubiquitously expressed in human tissues, 
with the highest amounts detectable in testes. It has been proposed that tankyrase interferes with the binding 
of TRF1 to telomeres, which In turn has an effect on telomere length. Tankyrase co-localizes with TRF1 at the 
50 ends of human chromosomes in metaphase and interphase, and also resides at nuclear pore complexes and 
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centrosomes. Smith et al. reported that the gene for tankyrase Is positioned at 17.6 cRioooo on human 
chromosome 8 with a LOD of 8.2 on the G3 map. 

The molecular events involved In managing chromosome structure and regulating ceO senescence are 
extremely complex. Each new protein found to participate in this process provides new opportunities for 
monitoring and intervening in some of the fundamental events of cell biotogy. 

SUMMARY OF THE INVENTION 



This Invention provides a new human protein which is hereby designated Tankyrase II. This new 
10 protein shares three domains with the Tankyrase protein of Smith et al.: the ANK domain comprising 24 
15 repeats of the ankyrin motif, the SAM domain thought to be Involved in protein-protein interaction, and the 

PARP domain that is responsible for the poly(ADP-ribose) polymerase activity. Tankyrase II further comprises 
has a new domain at the N-terminal, designated the GC domain, which has no known homologs. 

One of the embodiments of this invention is an isolated polynucleotide having at least about 
15 30 consecutive nucleotides contained in a human Tankyrase II encoding sequence, or that is contained in 
plasmids deposited under Accession No. 203919. or that hybridizes under stringent conditions to a Tankyrase II 
encoding sequence, but does not consist of the encoding sequence for human Tankyrase I or other previously 
known structurally related proteins, such as those having PARP activity. Another embodiment of this invention 
is an isolated polynucleotide having at least 100 consecutive nucleotides that Is at least 90% Identical to a 
25 20 Tankyrase II sequence, or contained In the deposited plasmids, but not in X-phage, Tankyrase I. or other 

previously known sequences. Certain polynucleotides of this invention encode a protein comprising a GC 
domain, a PARP domain, a SAM domain, or an ANK domain, or a protein that binds other telome re-associated 
proteins like TRF1, TRF2, TIN2. and Tankyrase I. or that ADP-ribosytates a target protein in the presence of 
NAD*. Polynucleotides of this invention can be used to obtain the encoded polypeptide, or to determine other 
30 25 polynucleotides that encode Tankyrase ll-like protein. 

Another embodiment of this invention is an isolated polypeptide comprising a sequence of at least 10 
consecutive amino acids that is contained in Tankyrase II, or is contained in the deposited plasmids, but is not 
contained In any previously known peptide sequence. Another embodiment of this invention is an isolated 
polypeptide comprising a sequence of at least 25 consecutive amino acids that is at least 90% identical to a 
35 30 Tankyrase II protein sequence, or a protein sequence encoded in the deposited plasmids. Certain polypeptides 

of this invention comprise a GC domain, a PARP domain, a SAM domain, or an ANK domain, or have activity 
for binding other telo mere-associated proteins like TRF1, TRF2, TIN2, and Tankyrase I, or ADP-ribosylate a 
target protein in the presence of NAD*. 

A further embodiment of this invention is an isolated human Tankyrase II protein or fragment thereof. 

40 

35 at least 1 0-fold higher in purity (or more) on a weight per weight basis than what occurs in natural sources. 

Also embodied in this invention are polynucleotides encoding the polypeptides of this invention, and 
antibodies of any sort that bind specifically to the polypeptides of this invention. Some of the antibodies inhibit 
the catalytic activity of Tankyrase II; inhibit the binding of Tankyrase II to other telomere associated protein; or 
45 inhibit protein ribosylation mediated by Tankyrase II. Peptides can be obtained by expressing a polynucleotide 

40 of the invention in a suitable host cell. Also provided are means for obtaining any antibody of this invention, 
comprising immunizing an animal or contacting an immunocompetent particle with a polypeptide of this 
invention. Peptides of this invention can be isolated from a mixture by using an antibody as a specific 
adsorbant; conversely, antibodies of this invention can be isolated using a peptide epitope as a specific 
50 adsorbant. 
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5 A further embodiment of this Invention is a method for ribosylating a target protein, comprising 

incubating the target protein with a peptide of this Invention in the presence of NAD*. 

Assay methods of this invention include determining Tankyrase II binding activity by incubating with a 
peptide of this invention under conditions where the protein can bind the peptide specifically to form a complex, 
5 and then correlating any complex formed with the presence or amount of the protein In the sample. The 
70 protein that has Tankyrase II binding activity can optionally be TRF1, TRF2, TIN2. or Tankyrase I. 

Another assay method of this invention is for screening a test compound to determine an ability to 
affect Tankyrase II activity, comprising incubating the compound with containing a peptide of this invention and 
a conjugate binding ligand, and determining any effect of the test compound on complex formation. Another 
10 such method comprises incubating a test compound with a peptide of this invention, a potential target protein, 
15 am | NAD*; then determining any effect of the test compound on the amount or rate of ribosytation of the target 

This invention also Includes a method for modulating Tankyrase II expression in a cell, comprising contacting 
the cell with the polynucleotide of this invention such as an antisense polynucleotide, a ribozyme. or an 
inhibitory RNA under conditions where the polynucleotide can interfere with mRNA translation. Modulating 
1 5 Tankyrase II expression in turn is believed to modulate telomere length m the cell. 
20 These and other embodiments of the invention win be apparent from the description that follows. 

BRIEF DESCRIPTION OF THE FIGURES 

25 20 Figure 1 Is a schematic depiction of Tankyrase II protein. The domains depicted are the GC domain 

(encoded by a gene segment rich In GC), the ANK domain, containing contains 24 ankyrin repeats thought to 
be involved in protein-protein interaction, the sterile alpha motif (SAM) domain, thought to be involved in ceOular 
signaling, and the poty (ADP)-ribose polymerase (PARP) domain, with enzymatic activity for ribosylating target 
proteins such as TRF1. 

33 25 Figures 2, 3, and 4 are sequence listings showing cDNA and amino acid sequence data for human 

Tankyrase II (SEQ. ID NOs: 1 to 6). The data from Figures 2-3 were obtained as described in Examples 1-4; 

the data from Figure 4 were obtained as described in Examples 6-7. 

Figure 5 is a sequence listing comparing Tankyrase II (SEQ. ID NO:6) with its closest known 

intraspecies homolog. Tankyrase I (SEQ. ID NOs:8), at the protein level. 
35 3Q Figure 6 is a sequence listing comparing Tankyrase II (SEQ. ID NO:5) with Tankyrase I (SEQ. ID 

NO:7), at the cDNA level. 

DETAILED DESCRIPTION OF THE INVENTION 

40 35 This disclosure describes the newly discovered protein Tankyrase II. Polynucleotides, polypeptides, 

and antibodies related to Tankyrase II are provided and exemplified. The protein has enzymatic activity that 
causes ribosylation of proximal target proteins using NAD as substrate. Tankyrase II is thought to have binding 
activity for other telomere-associated proteins, which could become ribosylated targets of the enzyme. This in 
turn could play a role in the regulation of telomere length, thereby affecting the replicative capacity of the cell. 
40 The techniques and materials in this disclosure provide the means to model Tankyrase II activity in vitro, and 
provide a way to monitor and modulate Tankyrase II activity in vivo. Modulation of Tankyrase II activity may be 
used to regulate telomerase activity or telomere length. 

Figure 1 shows the structurally distinct domains of Tankyrase II, which provide different functional 
50 features of tankyrase activity. There Is a unique amino-terminal (GC) domain, followed by an ankyrtn (ANK) 

— 3 — 
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motif domain, a sterile alpha modulo (SAM) domain, and a carboxy-terminal poty(ADP) ribose polymerase 
(PARP) domain. 

The ankyrin (ANK) domain of Tankyrase II contains 24 ankyrin repeats — a motif of about 33 residues 
found in a number of different proteins, and thought to act as modular adapters for heterologous protein-protein 
5 interactions (reviewed by Bennett et a!., J. Biol. Chem. 267:8703, 1992; Bennett et al.. and Michaely, TICB 
2:127, 1992: Bork et al.. Proteins: Structure. Function. & Genetics 17:363, 1993). A correlation has been 
observed between the number of ankyrin repeats and the nature of the protein-protein association. Ankyrin 
family members containing 24 ankyrin repeats bind cytoskeletal proteins such as tubulin and spectrin. 

The sterile alpha motif (SAM) domain of Tankyrase II lies downstream from the ANK domain. SAM 
ID domains are found in signaling proteins such as transcription factors, serine/threonine protein kinases, and 
15 GTPases, (Stapteton et al.. 1999, Nature Struct. Biol. 6:44-9; Thamos et al., 1999. Science 283: 833-36). SAM- 

containlng proteins form hetero- and homo-dimers with other SAM-containing proteins that can regulate cellular 
signaling processes. 

The carboxy-terminus of Tankyrase II is a domain homologous to other proteins with poly (ADP)- 
15 ribose polymerase activity, referred to as the PARP domain. Proteins that contain a PARP domain catalyze the 
20 addition of long branched chains of ADP-ribose to target proteins, using nicotinamide adenine dinucleotlde 

(NAD*) as a substrate (reviewed by Still et al., Genomics 62:533, 1999; de Murcia et at., Trends Biochem. Sd 
19:172, 1994; Lindahl et al.. Trends Biochem. Sci. 20:405. 1995). The first such protein, PARP-1 (Adprtl) 
contains DNA-binding zinc fingers, a nuclear localization sequence, and an automodification domain. PARP-1 
20 binds to nicked DNA, and is thought to play a role in chromosomal damage repair (PA Jeggo. Current Biol. 
8:R49, 1998). PARP-2 (AdprtL2) is a homologous protein with ribosylation activity that also binds damaged 
DNA (Ame et 8t. J. Biol. Chem. 274:17860, 1999). VPARP is a related protein that ribosylates major vault 
protein in the mammalian ribonucleoprotein complex (Kickhoefer, J. CeP Biol. 146:917, 1999). Other members 
of the PARP family include Tankyrase I and AdprtLI (Still et al., supra). 
30 25 The presence of the ANK, SAM and PARP domains in Tankyrase II suggests that Tankyrase II plays a 

role in intercellular or intracellular communication (e.g., signal transduction), possibly in conjunction with 
proteins involved in DNA repair pathways or maintenance of telomeres. Ribosylation can also play an 
important role in how Tankyrase II regulates other proteins involved in telomere management. Ribosylation of 
telomere-associated proteins may result in them leaving the telomere, potentially modulating the activity of 
35 30 telomerase reverse transcriptase, and thereby affecting telomere length and replicative capacity of the cell. 

The SAM. PARP and ANK domains of Tankyrase II are homologous to counterpart domains in the 
Tankyrase t protein. However, the N-terminal domains appear to have no homology. Tankyrase I has a 180- 
resldue HPS domain, so called for the abundance of histidine, proline, and serine residues. In contrast, 
Tankyrase II has a substantially different amino acid composition, encoded by a highly GC-rich gene sequence. 
40 35 This N-terminal domain of Tankyrase II will be referred to in this disclosure as the "Divergent" or "GC" domain. 

There is a relationship between the attainment of a critical telomere length In dividing somatic cells 
and DNA damage, and both processes lead to cell cycle arrest and the activation of gene expression pathways. 
Thus, Tankyrase II may communicate with a subset of the signaling molecules in DNA repair processes to 
Initiate the specific arrest and gene activation pathways of cellular senescence. Notably, Tankyrase I has been 
40 demonstrated to ribosytate both itself and TRF1 (Smith et at., 1998, supra), resulting in a reduction of the ability 
of TRF1 to bind telomeric DNA. The link between telomere structure and DNA repair is supported by the 
observation that p53- and ATM (ataxia telangiectasia mutated) dependent apoptosis is induced by telomeres 
with attenuated TRF2 function (Kartseder et al., 1999. Science 283:1321-1325). 

50 
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The structural features of Tankyrase II indicate that it binds to nuclear and cell proteins, and is 
involved In intercellular or intracellular cell signaling that affect telomere structure and metabolism. 
Compositions and treatments that modulate Tankyrase II expression or function are likely to be of therapeutic 
benefit for cancer, disorders associated with replicative senescence, and other conditions associated with 
perturbations of teJomerase activity or telomere length. 



Definftions 

The term "polynucleotide" as used in this disclosure refers to a polymeric form of nucleotides of any 
length. Included are genes and gene fragments, mRNA, tRNA, rRNA, ribozymes, cDNA. recombinant 
10 polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA and RNA. nucleic acid probes, and 
primers. Also Included are nucleotide analogs, including but not limited to thiol-derivatized nucleosides (U.S. 
Patent 5,578,718). oligonucleotides with modified backbones (U.S. Patent Nos. 5,541,307 and 5.378,825). and 
peptide nucleic acids (U.S. Patent No. 5,786.461). The term polynucleotide, as used in this disclosure, refers 
Interchangeably to double- and single-stranded molecules. Unless otherwise specified or required, any 
15 embodiment of the invention that is a polynucleotide encompasses both a double-stranded form, and each of 
the two complementary single-stranded forms known or predicted to make up the double-stranded form. 

The term "oligonucleotide* is reserved for polynucleotides of no more than 100 bases in length, and 
may or may not be accompanied with an antisense strand, depending on context. Oligonucleotides are often 
used as probes in specific hybridization reactions, or as primers in amplification reactions. 
25 20 When comparison is made between polynucleotides for degree of identity, it is implicitly understood 

that complementary strands are easily generated, and the sense or antisense strand is selected or predicted 
that maximizes the degree of identity between the polynucleotides being compared. Percentage of sequence 
identity is calculated by first aligning the polynucleotide being examined with the reference counterpart, and 
then counting the number of residues shared between the sequences being compared as a percentage of the 
30 25 region under examination. No penalty is Imposed for the presence of insertions or deletions, but insertions or 

deletions are permitted only where cleariy required to readjust the alignment. The percentage is given in terms 
of residues in the sequence being examined that are identical to residues in the comparison or reference 
sequence. Particularly desirable polynucleotide sequences preserve at least one function of the prototype. By 
way of example and depending on context, the function preserved may include an ability to hybridize with a 
35 30 target sequence, the function of a polypeptide it may encode, or (for certain gene targeting vectors) the ability 

to facilitate homologous recombination or gene inactivation. An example of an algorithm suitable for finding 
homologous sequences and determining percent sequence identity is the BLAST algorithm, (Altschul et al.. 
1990, J. Mol. Biol. 215:403, 1990; Karlin et al.. Proc. Natl. Acad. Sci. USA 90:5873, 1993), available through 
the National Center for Biotechnology Information (www.ncbi.nlm.nlh.gov/entrez). 

40 

35 Polynucleotide sequences are said to be in a "non-natural arrangement" when they are joined together 

or interposed with another sequence in an arrangement not found in nature. 

"Hybridization' refers to a reaction in which one or more polynucleotides react to form a complex that 
is stabilized via hydrogen bonding between the bases of the nucleotide residues. The hydrogen bonding can 
4 5 occur by Watson-Crick base pairing, Hoogsteen binding, triplex formation, or complexing in any other 

40 sequence-specific manner. A hybridization reaction will, on occasion, be a step in a more extensive process, 
such as part of PCR amplification. Hybridization reactions can be performed under conditions of different 
"stringency". Conditions that increase the stringency of a hybridization reaction are widely known (see e.g., 
Sambrook et al., Infra). Examples of conditions in order of increasing stringency: Incubation temperatures of 
50 25'C. 37°C, 50°C, and 68"C; buffer concentrations of 10 x SSC, 6 x SSC. 1 x SSC. 0.1 x SSC (where SSC is 
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0.15 M NaCl and 15 mM citrate buffer, pH 12) and their equivalent using other buffer systems; formamide 
concentrations of 0%. 25%, 50%. and 75%; incubation times from 5 min to 24 h; 1, 2, or more washing steps; 
wash incubation times of 1, 5, or 15 min; and wash solutions of 6 x SSC, 1 x SSC. 0.1 x SSC. or deiontzed 
water. Typical conditions of high stringency for the binding of a probe of about 100 base pairs and above is a 
5 hybridization reaction at 65*C in 2 x SSC, followed by repeat washes at 0.1 x SSC — or the equivalent 
combination of solvent and temperature conditions for the particular nucleic acids being studied. 

A "hybrid" of polynucleotides, or a 'complex" formed between any two or more components in a 
biochemical reaction (such as antibody and antigen), refers to a duplex or higher-order complex that is 
sufficiently long-lasting to persist between its formation and subsequent detection. 
10 A "control element" or "control sequence" is a nucleotide sequence involved in an Interaction of 

/5 molecules that contributes to the functional regulation of a polynucleotide, including replication, duplication, 

transcription, splicing, translation, or degradation of the polynucleotide. "Opera tively linked" refers to an 
operative relationship between genetic elements, in which the function of one element influences the function of 
another element For example, an expressible encoding sequence may be operatively linked to control 
15 element that permit transcription and translation. 
20 The terms "polypeptide", "peptide" and "protein" are used interchangeably hi this disclosure to refer to 

polymers of amino acids of any length. The polymer may comprise modified amino acids, it may be linear or 
branched, and it may be interrupted by non-amino acids. The terms also encompass an amino acid polymer 
that has been modified naturally or by intervention; for example, disulfide bond formation, glycosylation. 
20 lipidation. acetylation. and/or phosphorylation. 

Percentage of sequence identity is calculated for polypeptides by first aligning the polypeptide being 
examined with the reference counterpart or prototype, and then counting the number of residues shared 
between the sequences being compared as a percentage of the region under examination. No penalty is 
imposed for the presence of insertions or deletions, but Insertions or deletions are permitted only where clearly 
30 25 required to readjust the alignment. The percentage is given in terms of residues in the sequence being 

examined that are identical to residues in the comparison or reference sequence. Where substitutions are 
made, conservative substitutions (in which one amino acid is substituted by another with similar charge, size, 
hydrophobicity, or aromaticrty) are typically better tolerated. Desirable sequences preserves the function of the 
prototype: for example, the enzymatic activity, the binding of specific substrates, and the binding of specific 
35 30 antibody as detectable in a standard competition inhibition immunoassay. In certain embodiments, the identity 

may exist over a region that is at least about 10. 20-25, or 50-100 amino acids in length. 

The term "antibody" as used in this disclosure refers to both polyclonal and monoclonal antibody of 
any species. The ambit of the term deliberately encompasses not only intact immunoglobulin molecules, but 
also such fragments and genetically engineered derivatives of immunoglobulin molecules (including humanized 
40 35 forms) that may be prepared by techniques known in the art. and retaining the binding specificity of the antigen 

binding site. 

An "immunogenic" compound or composition is capable of stimulating production of a specific 
immunological response when administered to a suitable host, usually a mammal. 

An "Isolated" polynucleotide, polypeptide, protein, antibody, or other substance refers to a preparation 

45 

40 of the substance devoid of at least some of the other components that may also be present where the 
substance or a similar substance naturally occurs or Is initially obtained from. Thus, for example, an isolated 
substance may be prepared by using a purification technique to enrich it from a source mixture. Enrichment 
can be measured on an absolute basis, such as weight per volume of solution, or it can be measured in relation 
to a second, potentially Interfering substance present in the source mixture. Enrichments by 2, 10, 100, and 
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1000 fold achieve improved degrees of purification. A substance can also be provided in an isolated state by a 
process of artificial assembly, such as by chemical synthesis or recombinant expression. An "isolated* cell is a 
cell that has been separated from the organism in which it was grown. 

A polynucleotide used in a reaction, such as a probe used in a hybridization reaction or a vector used 
5 in gene targeting is referred to as "specific* or "selective" if it hybridizes or reacts with the intended target more 
frequently, more rapidly, or with greater duration than it does with alternative substances. Similarly, a 
polypeptide is referred to as "specific' or "selective* if it binds an intended target, such as a ligand, hapten, 
substrate, antibody, or other polypeptide more frequently, more rapidly, or with greater duration than it does to 
alternative substances. An antibody is referred to as "specific* or "selective* if it binds via at least one antigen 
10 recognition site to the intended target more frequently, more rapidly, or with greater duration than it does to 
15 alternative substances. 

General Techniques 

Unless otherwise noted, the practice of this invention can be carried out by employing standard 
15 techniques of genetic engineering, protein manipulation, and cell culture. Textbooks that describe standard 
20 laboratory techniques include 'Molecular Cloning: A Laboratory Manuar. 2nd Ed. (Sambrook et a!., 1989); 

'Oligonucleotide Synthesis" (M.J. Gait, ed., 1984); 'Animal Cell Culture' (RX Freshney. ed., 1987); the series 
'Methods in Enzymology" (Academic Press, Inc.); "Gene Transfer Vectors for Mammalian Cells" (J.M. Miller & 
MP. Calos, eds., 1987); 'Current Protocols in Molecular Biology" and "Short Protocols in Molecular Biology, 3rd 
20 Edition' (F.M. Ausubel et al., eds., 1987 & 1995); and "Recombinant DNA Methodology IV (R. Wu ed.. 
Academic Press 1995). Techniques used in raising, purifying and modifying antibodies, and the design and 
execution of immunoassays, are described in Handbook of Experimental Immunology (D.M. Weir & C.C. 
Blackwell. eds.); The Immunoassay Handbook (Stockton Press NY, 1994); and R. Masseyeff, W.H. Albert, and 
N.A. Staines, eds.. Methods of Immunological Analysis (Weinhelm: VCH Vertags GmbH. 1993). 

30 25 

Polynucleotides 

The polynucleotides of this invention include those containing nucleotide sequences which are found 
within the Tankyrase II DNA sequence, shown in SEQ. ID NOs:1, 3, and 5. Further sequence for Tankyrase II 
gene can be obtained by employing standard sequencing techniques known in the art to the phage plasmlds 
35 30 deposited in support of this application. 

Also included in this invention are polynucleotides that are from naturally occurring allelic variants, 
synthetic variants, and homologs of Tankyrase It with a percentage of residues identical to the Tankyrase II 
cDNA or gene sequence, determined as described above. It is understood that substitutions, insertions, and 
deletions can be accommodated within a polynucleotide sequence without departing from the spirit of this 
40 35 invention. In certain embodiments, the polynucleotide sequences are at least about 80% , 90%, 95%. or 98% 

identical to a sequence or part of a sequence exemplified In this disclosure; in order if increasing preference. In 
other embodiments, the polynucleotide sequences are 100% identical to a reference sequence or a fragment 
thereof. The length of consecutive residues in the identical or homologous sequence compared with the 
exemplary sequence can be at least about 15, 30, 50, 75. 100, 200 or 500 residues in order of increasing 
AO preference, up to the length of the entire clone, gene, or sequence. 

This invention includes polynucleotides that are uniquely related to the prototype polynucleotide 
sequences, in comparison with other sequences that may be present In a sample or reaction mixture of interest. 
By way of example, probes of at least about 100 consecutive nucleotides that are at least 90% or 60% identical 
to a reference sequence may be specific, and a probe of at least about 500 or 2000 consecutive nucleotides 
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may be specific if at least 90%, 80%. 70%, or even 60% identical with the reference sequence, depending on 
hybridization conditions, as explained below. On occasion, such nucleotides can be divided into halves (about 
nucleotides 1-2137 and 213B-4275). or quarters (about 1-1068; 1069-2137; 2138-3206; 3207-4275). and still 
retain their specificity. Nucleic acid molecules comprising specified lengths of consecutive nucleotides can be 
5 selected from any of these regions. U wiO also be recognized that for some purposes such as hybridization 
reactions, a specific polynucleotide sequence win readily accommodate deletions from the 5' or 3' end of either 
strand of say, 15, 25, or even 50 nucleotides without compromising function. Internal deletions may also be 
tolerated. 

Of particular interest are poly nucleotides that are distinct from polynucleotides encoding Tankyrase I. 
10 and other proteins containing ANK, PARP, or SAM domains. In certain embodiments of the invention, 
j 5 polynucleotides are distinct from one or more previously known EST sequences, such as those in Gen Bank 

Accession Nos. R64714. AA244138 (SEQ. ID NO:9), AA244137, AA307492, H11865. H17748, N57467, 
R06946, A1247608, R06902, AI247608, H11505, H17635, N29528, AA088990 AJ426537, and AW157349 
(SEQ. ID NO:10), and those listed elsewhere in this disclosure. A polynucleotide of this invention can be 
15 "distinct" from other polynucleotides because of an internal sequence difference (a substitution, deletion, or 
20 Insertion), or because it Is defined to encompass additional sequence at either end. Also included in the 

invention are recombinant or synthetic polynucleotides in which a Tankyrase It-like sequence is linked to a 
heterologous sequence to form: for example, a heterologous promoter in an expression vector, or a selectable 
marker such as neo in a targeting vector. 
20 The polynucleotides of this Invention can be In the form of an expression vector, In which the encoding 

25 sequence is operative^ linked to control elements for transcription and translation in a prokaryotic or eukaryotic 

host cell of interest. A variety of suitable vectors and their design and manufacture are known in the art. Vector 
systems of interest include but are not limited to those based on retroviruses, adenoviruses, adenoassociated 
viruses, herpes viruses, SV40. papilloma virus. Epstein Barr virus, vaccinia virus, lenti virus, and Semliki Forest 
25 virus. 

Particular polynucleotides of this invention are useful for producing polypeptides of interest, as 
nucleotide probes and primers, and as targeting vectors for genetic knockouts. Further description of the 
characteristics of such constructs are provided elsewhere in this disclosure. 

35 30 Preparation 

The polynucleotides of this invention can be prepared by any suitable technique in the art. Using the 
data provided in this disclosure or deduced from the deposited plasmids, sequences of less than -50 base 
pairs are conveniently prepared by chemical synthesis, either through a commercial service or by a known 
synthetic method, such as the triester method or the phosphite method. A suitable method is solid phase 
40 35 synthesis using mono nucleoside phosphoramidite coupling units (Hirose et al., 7etra. Lett. 19:2449-2452, 1978; 

U.S. Patent No. 4,415,732). 

For use in antisense therapy, polynucleotides can be prepared synthetically that are more stable for 
the pharmaceutical preparation for which they are intended. Non-limiting examples include thiol-derivatized 
nucleosides (U.S. Patent 5,578,718), oligonucleotides with modified backbones (U.S. Patent Nos. 5,541,307 
45 40 and 5.378,825). Also of interest in the context of antisense constructs are peptide nucleic acids. Prototype 

PNA have an achiral poly amide backbone consisting of N-(2-amt no ethyl) glycine units, to which purine and 
pyrimidine bases are linked, for example, by way of a methylene carbony) linker. PNAs are nuclease and 
protease resistant, and the uncharged nature of the PNA oligomers enhances the stability of PNA-nucleotide 
duplexes, thereby blocking transcription or translation. Uptake into cells can be enhanced by conjugating to 
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lipophilic groups Incorporating into liposomes, and introducing an amino acid side chain into the PNA 
backbone. See Soomets et a)., Front. Bcosci. 4:D782, 1999; U.S. Patents 5,539.082, 5.766.855, 5,786,461, 
and International Patent Application WO 8/53801 . 

Polynucleotides of this invention can also be obtained by PCR amplification of a template with the 
5 desired sequence. Oligonucleotide primers spanning the desired sequence are annealed to the template, 
10 elongated by a DNA polymerase, and then melted at higher temperature so that the template and elongated 

oligonucleotides dissociate. The cycle Is repeated until the desired amount of amplified polynucleotide is 
obtained (U.S. Patent Nos. 4,883.195 and 4,683.202). Exemplary primers are shown in Table 1. Suitable 
templates include the plasmids deposited in support of this application, and cONA libraries for cells expressing 
10 Tankyrase 11. Encoding sequences, intron sequences, and upstream or downstream sequences for 
15 Tankyrase II can be obtained from a human genomic DNA library. 



TABLE 1: Exemplary primers for amplifying Tankyrase II sequences 



20 





Forward & Reverse Primers 




Function 


UTANKII-32: 
LTANKIl-37: 


5'-TCC AGAG GCTGGTGACC CCTGA-3" 
5'-TTGAACTAACTACTGAAGA -3* 


SEQ. ID NO: 11 
SEQ. ID NO: 12 


Amplifies entire 
ANK domain 


UTANKII-38: 
LTANKII-39: 


5'-CTGTCTTCAGTAGTTAGTTCA-3* 
5'-GTTAC AAAC CTTCTG AATCT-3' 


SEQ. ID NO: 13 
SEQ. ID NO: 14 


Amplifies entire 
SAM domain 


UTANKII-40: 
LTANKII-41: 


5' -GAAAGATAC ACTC ACCGGA-3* 
S'-TAGGGTTCAGTGGGAATTAG-S* 


SEQ. ID NO: 15 
SEQ. ID NO: 16 


Amplifies entire 
PARP domain 


gt11-5': 
gt11-3': 


S-GACTCCTGCAGCCCGTCA-^ 
5'-GGTAGCGACCGGGCGTCA-3' 


SEQ. ID NO: 17 
SEQ. ID NO: 18 


Amplifies X11L-1-1 
cDNA insert 



Production scale amounts of large polynucleotides are most conveniently obtained by inserting the 
15 desired sequence into a suitable cloning vector and reproducing the clone. Techniques for nucleotide cloning 
are given in Sambraok. Fritsch & Maniatis (supra) and in U.S. Patent No. 5.552,524. Exemplary cloning and 
expression methods are illustrated in Examples 1 and 2. below. Polynucleotides can be purified by standard 
35 techniques in nucleic acid chemistry, such as phenol-chloroform extraction, agarose gel electrophoresis, and 

other techniques known in the art. adapted according to the source of the polynucleotide. 

20 

Assessment and use of the polynucleotides 

Polynucleotides of this invention can be used to identify Tankyrase II nucleotide sequences in a 
40 sample of interest for research, diagnostic evaluation, or any other purpose. Generally, this will involve 

preparing a reaction mixture in which a sample suspected of containing an Tankyrase ll-related sequence is 
25 contacted with a polynucleotide of this invention under conditions that permit the polynucleotide to hybridize 
specifically with the compound being tested for, detecting any stable hybrids that form, and correlating the 
hybrids with the presence of a Tankyrase II related sequence in the sample. The formation of stable hybrids 
45 can be detected by any suitable method known in the art. For example, the probe sequence with a detectable 

label such as a radioisotope, a chromophore, or a hapten such as avidin to which an signaling reagent can be 
30 attached. Alternatively, the reagent polynucleotide can be a primer for an amplification reaction in which the 
amount of product produced correlates with the formation of specific hybrids. 

50 
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The specificity of the probe or primer, and the stringency of hybridization conditions are both chosen 
with a view to facilitating detection of sequences of interest, while diminishing false positive reactions. Thus, 
when it is important to distinguish between Tankyrase II sequences from Tankyrase I sequences, particularly 
when using sequence outside the GC domain, then stringency conditions should be high, and the reagent 
5 polynucleotide should be nearly Identical to the sequence being tested for. Conditions can be determined 
empirically so that the reagent polynucleotide wUl hybridize with the Tankyrase II sequence being tested for but 
not with other sequences that might be present in the sample of interest In other instances, assays for 
Tankyrase II are conducted on samples where Tankyrase I is not present, or where it is desirable to test for 
Tankyrase I and Tankyrase II together. In these instances, the capability of the probe to cross-hybridize with 
10 Tankyrase i is not a hindrance, and may provide certain advantages. 
75 Polynucleotides of this invention can also be used to inhibit the transcription or translation of 

Tankyrase II in target cells. Such polynucleotides can be in the form of antisense constructs, which in some 
embodiments binds to Tankyrase II mRNA and prevent translation. Other polynucleotides of this invention are 
ribozymes having a substrate (Tankyrase II mRNA) binding portion, and an enzymatic portion with 
15 endonuclease activity that cleaves the substrate. Design and use of ribozymes is described generally in U.S. 
20 Patent Nos. 4.987,071 . 5.766,942, 5.998,193. and 6.025.167. The modulation of Tankyrase II expression using 

ribozyme constructs is embodied In this invention. 

This invention also includes interfering RNA (RNAi) complexes. The structure and activity of RNAi is 
reviewed by Bosher et al. (Nature Ceil Biol. 2:E31. 2000) and CP. Hunter (Curr. Biol. 9:R440, 1999). The RNAi 
20 complexs of this invention comprise double-stranded RNA comprising Tankyrase II sense and antisense 
^ polynucleotides (optionally in a hairpin configuration) that specifically inhibits translation of mRNA encoding 

Tankyrase II and Tankyrase ll-like proteins. Also contemplated are polynucleotides that bind to duplex 
Tankyrase II sequences to form a triple helix-containing nucleic acid, blocking expression at the transcription 
level (Gee et al.. in Huber and Carr, 1994, Molecular and Immunologic Approaches. Futura Publishing Co.; 
25 Rininsland et al.. 1 997. Proc. Natl. Acad. Sci. USA 94:5854, 1 997). 

This invention also encompasses polynucleotides that encode polypeptides of interest. 
Characteristics of the polypeptides of this invention are described in the section that follows. For polypeptides 
that are fragments of naturally occurring Tankyrase II, there will be a corresponding naturally occurring 
polynucleotide encoding sequence. Those skilled in the art will recognize that because of redundancies in the 
35 30 amino acid code, any polynucleotide that encodes a peptide of interest can be used in a translation system to 

produce the peptide. Except where otherwise required, all possible codon combinations that translate into the 
peptide sequence of interest are included in the scope of the invention. 



30 



Polypeptides 

40 35 The polypeptides of this invention include those that comprise amino acid sequences encoded within 

any of the polynucleotides of this invention, exemplified by SEQ. ID NO 6 and its subfragmenls. Also included 
in this invention are polypeptides containing Tankyrase II like sequence that is from naturally occurring allelic 
variants, synthetic variants, and homologs of Tankyrase II with a percentage of residues identical to the 
Tankyrase II protein, calculated as described elsewhere in this disclosure. 

^ 40 It is understood that substitutions, insertions, and deletions can be accommodated within a protein 

sequence without departing from the spirit of this invention. Conservative substitutions are typically more 
tolerable, such as the substitution of charged amino acids with amino acids having the same charge, or 
substituting aromatic or lipophylic amino acids with others having similar features. Certain peptides of this 
invention are 60%, 80% , 90%, 95%. or 100% Identical to one of the sequences exemplified in this disclosure; 
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in order of Increasing preference. The length of the identical or homologous sequence compared with the 
prototype polypeptide can be about 7, 10, 15. 25, 50 or 100 residues in order of increasing preference, up to 
the length of the entire protein. 

This invention includes polypeptides that are uniquely related to the prototype sequences, in 
5 comparison with other sequences that may be present in a sample or reaction mixture of interest By way of 
example, peptides of at least about 10 consecutive amino acids that are at least 90% or 80% identical to a 
reference sequence, or wherein the rest of the peptide contains only conservative substitutions, may uniquely 
identify the peptide in terms of functional or antigenic characteristics. A peptide of at least about 25. 100. or 
300 consecutive amino adds may be specific ff at least 90%. 70%, 60%. or even 50% identical with the 
10 reference sequence. Longer peptides can be divided into halves (amino acids 1-584 and 585-1 165 of SEQ. ID 
15 NO:6). or quarters (amino acids 1-292; 293-584; 585-876; 877-1166) and still retain one or more of their 

functional activities — such as ribosytation of target proteins, and the binding to conjugate peptides through the 
Tankyrase II ANK and SAM domains. Peptides from the region encoded by nucleotides 1-283 of SEQ. ID NO:5 
are also of interest. II will be recognized that for some purposes such as reactions with antibody or the contact 
15 region on an opposing protein, a specific polypeptide sequence will readily accommodate deletions from the N- 
20 or C- end, say, of 3. 5, or even 10 amino acid residues. 

Certain peptides of this invention are distinct from peptides previously known: These include human 
Tankyrase I proteins, and other proteins comprising PARP, SAM, and ANK domains. In certain embodiments, 
polypeptides of the invention are distinct from predicted amino acid sequences encoded in one or more 
20 previously known polynucleotide sequences, such as those cited at other places in this disclosure. A 
25 polypeptide of this invention can be "distinct" from other polypeptides because of an internal sequence 

difference (a substitution, deletion, or insertion), or because it is defined to encompass additional sequence at 
either end. Also included in the invention are artificially engineered fusion proteins in which a Tankyrase II like 
sequence is linked to a heterologous sequence which modulates Tankyrase II activity, provides a 
25 complementary function, acts as a tag for purposes of labeling or affinity purification, or has any other desirable 
purpose. 



30 



Preparation 

Short polypeptides of this invention can be prepared by solid-phase chemical synthesis. The 
2$ 30 principles of solid phase chemical synthesis can be found in Dugas & Penney, Bioorganic Chemistry. Springer- 

Verlag NY pp 54-92 (1981). and U.S. Patent No. 4,493,795. Automated solid-phase peptide synthesis can be 
performed using devices such as a PE-Applied Blosy stems 430A peptide synthesizer (commerciaiiy available 
from Applied Biosystems. Foster City CA). 

Longer polypeptides are conveniently obtained by translation in an in vitro translation system, or by 
40 35 expression in a suitable host cell. To produce an expression vector, a polynucleotide encoding the desired 

polypeptide is operably linked to control elements for transcription and translation, and then transfected into a 
suitable host cell. Expression may be effected in prokaryotes such as E. coli, eukaryotic microorganisms such 
as the yeast Saccharomyces cerevisiae, or higher eukaryotes, such as insect or mammalian cells. Control 
elements such as the promoter are chosen to permit translation at an acceptable rate under desired conditions. 
45 40 A number of expression systems suitable for producing the peptides of this invention are described in U.S. 

Patent No. 5,552.524. Expression cloning Is available from such commercial services as Lark Technologies, 
Houston TX. 

Following production, the protein is typically purified from the producing host cell by standard methods 
in protein chemistry in an appropriate combination, which may include ion exchange chromatography, affinity 
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chromatography, and HPLC. Expression products are optionally produced with a sequence tag to facilitate 
affinity purification, which can subsequently be removed by proteolytic cleavage. 

Also contemplated are Tankyrase II protein isolated from human biological samples, including tissue 
samples and cultured ceil lines, tracking activity on the basis of functional assays and/or immunoassays 
5 provided below. Antibody to Tankyrase II described In the following section can be used in immunoaffinity or 
immunopreciprtation techniques to enrich Tankyrase tl from biological samples. If desired, fragments can be 
made from whole Tankyrase II by chemical cleavage (e.g.. using CNBr). or enzymatic cleavage (using trypsin, 
pepsin, dispase. V8 protease, or any other suitable endopeptldase or exopeptidase). Enrichment of peptides 
and proteins of this invention from natural or synthetic sources provides a purity of 10-fold. 100-fold, 1000-fold. 
10 or 10,000-fold higher than what is found in nature, in terms of a weight to weight ratio of Tankyrase II peptide to 
1 5 other proteins in the sample mass. 

Assessment and use of the polypeptides 

Polypeptides of this invention can be used for a number of purposes, including but not limited to the 
15 characterization of telomerase function and how it is regulated, assays for proteins and nucleotide sequences 
20 to which Tankyrase II binds, the identification of new proteins with Tankyrase II binding activity that may play a 

role in maintaining telomere length, replicative capacity, apoptosis, chromosome packing, or gene expression, 
and the obtaining of antibody specific for Tankyrase II. 

Subregions of Tankyrase II and homologs can be assessed for function based on the known domain 
20 structure of Tankyrase II, and employed according to the role they play in the activity of the whole molecule. 

A putative PARP domain in a Tankyrase II homolog can be identified on the basis of sequence 
similarity, since a high degree of conservation with Tankyrase II PARP and other proteins with PARP domains 
(e.g., Tankyrase I, PARP-1 and PARP-2), especially over critical conserved residues, correlates with 
ribosylation activity. Residues of Tankyrase II thought to play an important role in the enzymatic activity are 
25 shown in Table 2. Functional assays for pory(ADP-ribose) polymerase can be conducted by incubating the 
putative PARP-contalning peptide with a target protein (such as the Tankyrase II ANK and domains), or TRF1. 
in the presence of nicotinamide adenine dinudeotide (NAD*), or an analog labeled with a radioisotope such as 
^P or M P, biotin. or a fluorescent group. ADP-ribosylation can be monitored by incorporation of the label into 
the protein phase, by a change of size of the target protein (measurable, for example on a protein gel), or by 
35 30 detection of ADP ribose polymers on the target (for example, using commercially available antibody specific for 

ADP-ribose polymers, by digestion with glyco hydrolases, or by physical-chemical mechanisms, such as mass 
spectrometry). Known PARP inhibitors like 3 amino-benzamide (3AB) can be used to verify the specificity of 
the assay. A number of other assays for rapid detection of poty(ADP«ribose) polymerase activity have been 
described. See Sallmann et al., Mol. Cell Blochem. 1B5:199, 1998; Simonin et al.. Anal. Biochem. 195:226. 
40 35 tggt; shah et al.. Anal. Biochem 232:251. 1995. Peptides with confirmed PARP activity can then be used as a 

reagent to ADP-ribosylate protein targets of interest 

A putative SAM domain can also be identified on the basis of sequence similarity with the sterile alpha 
motif domain in other proteins (Tankyrase ll, Tankyrase I. EphB2 receptor, and others reviewed by Stapleton et 
al.. Nature Struct. Biol. 6:44, 1999). The SAM domain has been implicated in forming homodimers and 
45 40 heterodimers with other SAM-containing proteins (Stapleton et al.. op. cit.; and Kyba et al., Dev. Genet. 22:74, 

1998; Thanos et al., Science 283:833. 1999). Thus, putative SAM domains can be screened functionally in 
dimerization reactions: either with themselves, or with SAM domains from other proteins with known 
heterodimertzatton activity. Dimerization can be detected in an equilibrium system (e.g.. using a biosensor), or 
in a separation system (e.g., by gel filtration chromatography or in a gel shift experiment). Dimerization can 
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also be detected in a reporter gene assembly — for example, where a conjugate binding site on another protein 
is fused to a DNA-binding peptide, and the putative SAM domain is fused to a trans-activator. These constructs 
are then transfected into a ceil comprising a reporter gene (such as Lac Z). which signals proximity of the trans- 
activator, indicating binding between the two peptides. Peptides with confirmed SAM activity according to any 
5 of these assays can then be used in turn as reagents in a dimerization assay to detect or quantitate 
Tankyrase II or other proteins with a SAM domain They can also be used to inhibit Tankyrase II activity by 
competition at the SAM binding site. 

Putative ANK and GC domains can also be identified on the basis of sequence similarity to 
corresponding domains in Tankyrase II. In addition, ANK domains characteristically have a number of tandem 
10 repeats about 33 amino acids long. Dozens of proteins containing anywhere from one to dozens of ANK 
15 repeats are known. Michaely et al. (Trends Cell Bio!. 2:127, 1992) report the consensus sequence as 

-XGXTPLHLAARXGHVEWKLLLDXGADVNAXTK- 
A I SQ NNLDIAEV K NPD D 
15 VKTMRQSIN 

E SEQ. ID NO: 19 

20 

ANK repeats generally are implicated in protein-protein binding, and the ANK domain in Tankyrase I is 
20 responsible for the binding of TRF1. Tankyrase II is believed to have binding activity for several proteins 
invoked in telomere regulation and other aspects of chromosome management. Such proteins include TRF1. 
TRF2, TIN2, and Tankyrase I, which are all proteins known to interact in the management of telomeres. 
2 5 Peptide fragments and homologs of Tankyrase II can be tested for binding to such proteins, and those showing 

activity can in turn be used to assay TRF1 , TRF2, TJN2, or Tankyrase I. The general format of such an assay 
25 comprises incubating a sample suspected of containing the protein with Tankyrase II binding activity with a 
peptide of this invention under conditions where the protein can bind the peptide to form a complex, and 
correlating any complex formed with the presence or amount of Tankyrase (I binding activity in the sample. In a 
30 similar fashion, fragments and homologs of Tankyrase II can be tested for binding to polynucleotides having 

particular sequences, such as the tandem repeats that are characteristic of telomeres. Since Tankyrase II binds 
30 and ribosylates telomere- associated proteins, fragments and homologs of Tankyrase II can also be tested for 
modulation of telomere length. Cells are transfected with an expression vector for the fragment or homolog, 
and the effect on telomere length is measured by a suitable method, such as the assays described in U.S. 
35 Patent Nos. 5,707,795, 5,741 .677, and 5,834.193. 

A systematic approach can be used to determine functional regions and homologs of Tankyrase II 
35 according to any of these assays. For example, the viability of an assay system is confirmed on the intact 
Tankyrase II protein; then a series of nested fragments is tested to determine the minimum fragment that 
provides the same activity. Similarly, amino acid substitutions can be introduced into the sequence until the 
activity is ablated, thereby determining what residues are critical for functional activity. 

Peptides of this invention can also be used for the preparation and testing of antibodies against 
40 Tankyrase II. for the testing of other compounds for Tankyrase II binding activity, and for the screening of 
potential Tankyrase II modulators. These procedures are detailed further on in this disclosure. 
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Dominant Negative Mutants 

Based on the sequence data provided in this disclosure, someone skilled in the art will be able to 
45 develop dominant negative polypeptide mutants of Tankyrase II, and polynucleotides that encode them. These 
mutants may be used to inhibit the function of Tankyrase II in a cell or reaction mixture. The production of 
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dominant negative mutants entails deleting or mutating an important functional element of the native 
Tankyrase II. For example, functional mutation or deletion of the ANK domain may produce peptides that do 
not bind to TRF1 or TRF2, but retain SAM binding activity. Conversely, a functional mutation or deletion of the 
SAM domain may produce peptides are deficient in binding to proteins such as TRF1 or TRF2, but still have the 
5 ribosylation activity of PARP. A functional mutation or deletion of the PARP domain (for example, mutation of 
all or a subset of the residues from the Tankyrase II C-terminus to alanine), may result in a peptide that binds 
Tankyrase II associated proteins, but does not have any ribosylation activity. 

Muteins with point mutations can also be obtained. Specifically, amino acids thought to be critical for 
the activity of the domain could be changed to a neutral amino acid such as alanine, and then reassayed for 
10 functional activity. For Tankyrase II, mutations that may abolish ribosylation activity are changes to His 
1 5 (position 1031), Gly (1032). Gly (1058), Tyr (1060). Tyr (1071), and GIu (1138). 



TABLE 2: Critical Residues for PARP Activity in Tankyrase II 



10 



20 



25 



ERYTHRRKEV 


SEENHNHANE 


RMLFHGSPFV 


NAIIHKGFDE 


RHAYIGGMFG 


Agl^FAENSS 


KSNQ*VYGIG 


GGTGCPVHKD 


RSCYICHRQL 


LFCRVTLGKS 


FLQFSAMKMA 


HSPPGHHSVT 


GRPSVNGIAL 


AgYVIYRGEQ 


AYPEYLITYQ 


IMRPEGMVDG 






SEQ. ID NO:20 



1 5 Screening for other Tankyrase U binding proteins 

Those skilled in the art will readily appreciate that the assays described earlier in this section can be 
30 adapted to screen for other proteins that may be involved in telomere regulation, cell proliferative capacity, 

senescence, and apoptosis. The Tankyrase II domains ANK and SAM have characteristic features of protein 
binding molecules, and can be used to identify binding partners for Tankyrase II by incubating with a candidate 
20 compound under conditions suitable for binding, typically a physiological isotonic buffer containing any 
necessary cofactors that may promote transmolecular interaction. The formation of binding complexes with a 
candidate binding partner that is demonstrably specific (by virtue of being higher affinity than the binding of 
other candidate compounds) correlates with binding activity for Tankyrase II. Positive controls include peptides 
or proteins which have binding activity for Tankyrase II, . while negative controls include ubiquitous and generally 
25 unreactive compounds, such as albumin. Candidates likely to screen positive for Tankyrase II binding include 
fragments and homologs related to telomere-associated proteins such as TRF1. 

This type of conjugate binding assay can be conducted in several different formats. For example, 
Tankyrase II containing protein complexes can be isolated from human tissues or cell lines, for example, by 
tracking Tankyrase II through standard protein purification regimens by way of ribosylation activity or 
30 Tankyrase II antibody binding. In a similar approach, natural sources of Tankyrase II are solubilized in a 
45 suitable buffer, and Tankyrase II complexes are immunoprecipitated. The conjugate binding partner is then 

recovered from the complex, and characterized by physical and chemical criteria (such as apparent molecular 
weight determined by SDS gel electrophoresis), amino acid sequencing, or binding assays with Tankyrase II 
domains. In another example. Tankyrase II is labeled with a traceable substituent. such as biotin. a fluorescent 
35 group, an enzyme, a radioisotope, or a peptide group (e.g., FLG, HA, myc, or an immunoreactive peptide 
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sequence), and then combined in a reaction solution with an isolated candidate binding partner, or with a 
mixture of components (such as a cell extract) In which compounds with Tankyrase II binding activity may be 
found. Formation of complexes with the labeled Tankyrase H is then detected (for example, by gel shift 
techniques or immunoprecipitation). and correlated with binding activity for Tankyrase II. 
5 Another format is a co express ion system, using Tankyrase It, a Tankyrase II fragment, or a 

Tankyrase II homolog as bait For example, a yeast two-hybrid screen system is employed, in which a 
Tankyrase II encoding sequence is fused to one part of the expression system, and a library of candidate 
binding partners Is fused to the complementary component needed for expression of the marker. Cloned cells 
that express the activity of the marker contain an insert that comprises the encoding sequence for a 
10 Tankyrase II binding partner. Yeast two-hybrid screen systems are described generally in Bianchi et al., EMBO 
75 J., 16:1785. 1997. Reagents and suitable libraries (e.g., human fetal liver cDNA transformants) are 

commercially available from Clontech, Palo Alto CA. 

Antibodies 

15 Antibody molecules of this invention include those that are specific for any novel peptide 

20 encompassed in this disclosure. These antibodies are useful for a number of purposes, including assaying for 

the expression of Tankyrase-ll, and purification of Tankyrase-ll peptides by affinity purification. 

Polyclonal antibodies can be prepared by injecting a vertebrate with a polypeptide of this invention In 
an immunogenic form. If needed, immunogenicity of a polypeptide can be enhanced by linking to a earner such 
20 as KLH, or combining with an adjuvant, such as Freund's adjuvant. Typically, a priming injection is followed by 
a booster injection is after about 4 weeks, and antiserum is harvested a week later. If desired, the specific 
antibody activity can be further purified by a combination of techniques, which may include Protein-A 
chromatography, ammonium sulfate precipitation, ion exchange chromatography, HPLC, and immunoafflnity 
chromatography using the immunizing polypeptide coupled to a solid support. Antibody fragments and other 
25 derivatives can be prepared by standard immunochemical methods, such as subjecting the antibody to 
cleavage with enzymes such as papain, pepsin, or trypsin. 

Any unwanted cross-reactivity can be removed by treating the polyclonal antibody mixture with 
adsorbants made of those antigens attached to a solid phase, and collecting the unbound fraction. 
Contaminating activity against other proteins containing ANK, SAM. or PARP domains, or against Tankyrase I, 
35 30 or against Tankyrase-ll from other species, can all be removed by adsorption if such cross-reactivity would 

interfere with the intended use of the antibody. Specificity of the original antisera can be improved to start with, 
by immunizing with peptide fragments of Tankyrase-ll that are substantially distinct from the equivalent region 
of the homologous protein. Alternatively, antibodies that cross-react with Tankyrase l can be enriched by 
immunizing with peptide sequences that are shared between the two proteins. This is illustrated in Example 1 1 . 
40 35 Production of monoclonal antibodies is described in such standard references as Harrow & Lane 

(1988), U.S. Patent Nos. 4,491,632, 4,472,500 and 4,444,887. and Methods in Enzymology 73B:3 (1981). 
Briefly, a mammal is immunized as described above, and antibody-producing cells (usually splenocytes) are 
harvested. Cells are immortalized, for example, by fusion with a non-producing myeloma, transfecting with 
Epstein Barr Virus, or transforming with oncogenic DNA. The treated cells are cloned and cultured, and the 
45 40 clones are selected that produce antibody of the desired specificity. 

Other methods of obtaining specific antibody molecules (optimally in the form of single-chain variable 
regions) involve contacting a library of immunocompetent cells or viral particles with the target antigen, and 
growing out positively selected clones. Immunocompetent phage can be constructed to express 
immunoglobulin variable region segments on their surface. See Marks et al.. New Eng. J. Med. 335:730, 1996, 
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5 International Patent Applications WO 94/13804, WO 92701047, WO 90/02809, and McGuiness et at.. Nature 

Biotechnol. 14:1449, 1998. 

Antibodies can be raised that distinguish between Tankyrase II and Tankyrase I by selecting an 
immunogenic peptide from a region unshared by the ANK, SAM, or PARP domains of Tankyrase I, or other 
5 proteins having one of these domains. Suitable subregions of Tankyrase U are shown in Table 3. 

10 



TABLE 3: tmmunogen Sequences for Tankyrase II Specific Antibody 





Amino Acid Sequence 


Location 


SEQ. ID NO: 


15 


M S G R R C AG GGAAC AS AAAEA VE 


Beyond N-terminal of ANK 
domain (GC region) 


21 




TAAMPPSALPSCYKPQVLNGVRSPG 
ATADALSSGPSSPSSLSAASSLDNLS 
GSFSELSSWSSSGTEGASSLEKKEV 
PGVDFSITQFVRN 


Sequence between ANK 
and SAM domains 


22 


20 


RPEGMVDG 


Beyond C-terminal of PARP 
domain 


23 



Antibody molecules in a polyclonal antiserum against intact Tankyrase II can be screened to map 
Immunogenic portions of the amino acid sequence. Sequential peptides about 12 residues long are 
10 synthesized that cover the entire protein (SEQ. ID NO:6). and overlapping by about 8 residues. The peptides 

25 can be prepared on a nylon membrane support by standard F-Moc chemistry, using a SPOTS™ kit from 

Genosys according to manufacturer's directions. Prepared membranes are overlaid with the antiserum, 
washed, and overlaid with p-galactosidase conjugated anti- immunoglobulin. Positive staining identifies 
antigenic regions, which, in an appropriate context, may themselves be immunogenic. There will also be 

^ 15 antibodies that span different parts of the primary structure, or which rely on a conformational component not 

displayed in smaller peptides. 

The antibodies of this invention can be used in immunoassays to detect or quantitate any of the 
polypeptides of this invention, including the natural form of Tankyrase II present in biological fluid or tissue 
samples. For example, it may be desirable to measure Tankyrase II in a clinical sample to determine whether 

3 5 20 the level of Tankyrase II expression is abnormal, and then correlating the finding with the presence or status of 

a disease associated with increased or decreased Tankyrase II activity or abundance. 

Genera) techniques of immunoassay can be found in 'The Immunoassay Handbook*, Stockton Press 
NY, 1994; and "Methods of Immunological Analysis", Weinhelm: VCH Verlags gesellschaft mbH, 1993). The 
antibody is combined with a test sample under conditions where the antibody will bind specifically to any 

40 25 modulator that might be present, but not any other proteins liable to be in the sample. The complex formed can 

be measured In situ (U.S. Patent Nos. 4,208,479 and 4,708,929), or by physically separating it from unreacted 
reagents (U.S. Patent No. 3,646,346). Separation assays typically involve labeled Tankyrase-ll reagent 
(competition assay), or labeled antibody (sandwich assay) to facilitate detection and quantitation of the 
complex. Assays of this nature can also be used in a competitive format to identify antibodies that bind to the 

45 30 same epitope on a target compound. In one such format, the reference antibody is labeled, and tested for 

binding to Tankyrase II in competition with a test antibody. Antibodies can also be screened to identify those 
with inhibitory capacity for the binding and catalytic activities of Tankyrase 11. 
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5 Modulating Tankvrase II actlvitv 

This invention provides a number of different approaches to modulate Tankyrase il activity in a live 
cell. In one embodiment, the eel is genetically altered using a polynucleotide that affects expression of 
Tankyrase II at the transcription or translation level. Suitable polynucleotides include antisense sequences, 
5 ribozymes. or polynucleotides that form triplexes with the chromosomal gene for Tankyrase II, all of which were 
/0 described in more detail earlier in this disclosure. In another embodiment activity of Tankyrase II within the ceil 

is inhibited by a peptide inside the ceQ that prevents Tankyrase 11 from exercising its usual function. Suitable 
peptides include intracellular antibody constructs that bind to regions of Tankyrase II necessary for catalytic or 
molecular binding activity, and dominant negative homologs that compete for the binding between Tankyrase II 
10 and a Tankyrase II binding partner. Proteins of this nature can be introduced Into a cell by contacting the cell 
15 W fth a polynucleotide expression vector for the intracellular antibody or the mutant homolog. 

Also contemplated are small molecule drugs that have the capability of modulating either Tankyrase II 
catalytic activity, or with its binding to conjugate partners. The ability to inhibit association between 
Tankyrase II and accessory proteins can be determined by introducing candidate inhibitors into any of the 
15 peptide binding assays described earlier, and correlating a decrease in protein complex formation with 
inhibitory capacity of the candidate. 

Compounds can be screened for an ability to modulate ribosylation by preparing a reaction mixture 
comprising the test compound, either Tankyrase II or the Tankyrase II PARP domain (or a functional 
equivalent), the NAD* substrate, and a ribosylation target (Tankyrase II itself, or a Tankyrase II associated 
20 protein). Ribosylation is monitored by incorporation of ^P or 33 P from labeled NAD* substrate into the solid 
phase, by a change of size of the target protein, or by any of the other techniques described earlier. An 
increase in ribosylation of the target correlates with an ability of the compound to enhance Tankyrase II 
ribosylation activity, while a decrease in ribosylation of the target correlates with inhibitory capacity of the test 
compound. The compound can also be screened in one or more parallel assays to determine whether it has 
20 25 the capacity to modulate the ribosylation activity of other enzymes — such as Tankyrase I, and other proteins 

containing PARP domains (reviewed recently by Still et al., Genomics 62:533, 1999). 

Compounds that modulate the activity of Tankyrase II but not other ribosylation enzymes can be 
selected when K is desirable to obtain compounds that are specific for Tankyrase II. These assays can be used 
to screen random combinatorial libraries of small molecule compounds, or as part of rational drug design, 
35 30 based on known PARP inhibitors such as 3-amlnc-benzamide. Other potential Tankyrase II inhibitors include 

4-amino-1,8-naphthalimide (Schlicker et al.. InL J. Radial Biol. 75:91, 1999), thiophenecaroxamides (Shinkwin 
et al.. Bioorg. Med. Chem. 7:297, 1999). and 2«nitroimidazol-5-y1methyl (Bioorg. Med. Chem. Lett. 9:2031. 
1999). 

It is potentially beneficial to modulate Tankyrase II activity in conditions associated with 

40 

35 overexpresslon or underexpression of Tankyrase II. Peptides, expression systems, and small molecule drugs 
can also be screened according to the effect on cell biology. Cells expressing Tankyrase II treated with the test 
system can thereafter be monitored for an effect on telomere length as described earlier, or on replicative 
capacity in proliferation culture. 

45 

40 The following examples are provided as further non-limiting illustrations of particular embodiments of 

the invention. 
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EXAMPLES 
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15 
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Example 1 : Identification o f expressed sequence tao s for Tankyrase II. 

A BLAST search against the GenBank dbEST database using the Tankyrase I sequence identified 
several expressed sequence tags (ESTs). Many of the ESTs were Identical in ONA sequence to the 
Tankyrase I gene. However, several ESTs coded peptides distinct from Tankyrase I. Further evaluation of 
these ESTs revealed they represented a distinct gene, termed Tankyrase il, since the DNA sequence identity to 
Tankyrase I was significantly tower than the amino acid identity, with a preponderance of silent third position 
codon changes. 

The ESTs R64714, AA244138, and AA244137 contain sequences of the ankyrin domain of 
Tankyrase II; the EST AA307492 contain sequences of the SAM domain; ESTs H11865. H17748, N57467, 
R06946, AI247608. and R06902 contain sequences of the PARP domain. Additional 5' sequence of the 3* EST 
AI247608 done revealed it diverges from the tankyrase gene and does not overlap with the SAM domain. It 
may contain an unprocessed intron. The 3* ESTs H11505, H17635, and N29528 were identified in GenBank as 
partners for the 5' ESTs H11865. H17748, and N57467. respectivery. These 9 ESTs along with additional 
sequence obtained from the H11865, H17748. N57467. R06946 clones formed a contig containing the PARP 
domain and - 1 kbp of the 3' UTR, Including a poly-A tail. These ESTs formed 3 contigs that contained 3 of the 
Tankyrase II domains and approximately 40 % of the coding region. 

Additionally, AA08B990 and AI426537 are ESTs containing the ankyrin and PARP domains of a 
putative mouse Tankyrase II, respectivery. 

Example 2: Cloning of the N-terminus of Tankyrase II. 

To extend the ANK EST contig, Rapid Amplication of cDNA Ends (RACE™) (Gibco-BRL, # 18374- 
058) was performed using the primers tanklt-2 and tankll-3 and (0 poly A+ RNA from BJ fibroblasts transduced 
with a retroviral vector expressing the human TERT gene (pBABE-TERT) and (ii) the Marathon-Ready testis 
cDNA (Clontech, #7414-1). This was followed by nested amplification using primers Ltankll-1 and Ltankll-2, 
respectively. The products from these amplifications were cloned into pCR2.1-TOPO (InVitrogen, # 45- 
0641). Four clones were identified by PCR with primers (AAP [Gibco] or AP-1 [Clonetech]) to the vector and 
the ANK EST contig (Ltankll-1 or Ltankll-2), termed inside-out PCR, to contain additional DNA 5* to the EST 
contig. Custom primers were designed based on the evolving sequence data. Subsequent sequence analysis 
indicated that only two, designated MP9 and MP12, contained authentic Tankyrase II sequences. 'Inside-out" 
PCR is the term used to describe amplification using a primer pair In which one primer is from the target gene 
(e.g., Tankyrase II) and the second primer is specific for the vector. 

Additional N-terminally extended clones were isolated by the GeneTrapper (Gibco-BRL, #10358-020) 
cDNA done enrichment procedure using the oligonucleotides Ltankll-4B, Ltankll-5B. and Ltankll-6B and 
plasmld cDNA libraries from liver and spleen (Gibco-BRL, #10422-012 and 10425-015. respectively). 
Approximately 100 GeneTrapper clones were screened by colony hybridization with a PCR probe (described 
infra) from the ANK done AA244138 and by inside-out PCR (primers: SP6/tankll-2 and SP6/tankll-3) to identify 
four dones (S10, S25, S34, and L1 1) that contained additional DNA 5' to the EST contig. 

Sequence analysis of the 2 RACE and 4 GeneTrapper clones formed a contig that extended 
approximately 200 bp downstream and approximately 1100 bp upstream of the original ANK EST contig. The 
5' most sequence terminated in DNA homologous to the most N-terminal tankyrase ANK repeat just after the 
HPS domain. 
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Example 3: Identification of X bacteriophage clones of Tankvrase II. 

Three X bacteriophage human cDNA libraries, XgtIO thymus (Clontech, HL1074a), Xgtll 293 human 
embryonic kidney cancer cell line, and XTriplx Testis (Clontech. cat # HL50331), were screened by plaque 
hybridization with a probe from the ANK done AA244138. The probe was generated by PCR using the primers 
5 Utankll-5 and Ltanklt-7. Twenty-six phage were positively Identified through secondary and tertiary plaque 
hybridizations. Using PCR the presence of the ANK EST contig was confirmed in all 26 phage. Additional PCR 
was used to identify one phage (X11L-1-1) from the 293 library that contained the most N-terminal ankyrin 
repeat and the ANK. SAM, and PARP contigs. The X1 1L-1-1 insert is believed to contain the entire Tankyrase (I 
coding sequence. Two other phage X11-L-1-3. X11L-1-4 (293) from the 293 library were identified that 
75 10 contained the most N -terminal ankyrin repeats and the original ANK contig. Inside-out PCR (primers gt10- 

57l_tankll-31 or gt11-57Ltankll-31), showed these clones contained up to 800 bp of additional Tankyrase II 
sequence upstream of the most N-terminal ankyrin repeat. These vector/N-terminal insert PCR products and 
PCR products from these four phage that linked the SAM and ANK contigs and the SAM and PARP contigs 
were sequenced directly. 

20 15 The XI 1L-1-1. X1 1L-1-3, or X1 1L-1-4 can be individually characterized by the following tests. Phage 

X11L-1-1 contains DNA that can be amplified with the primer pairs Utankll-5/UanklM6, Utankll-5/Ltankll-9, and 
Utankll-3/LtanklMO (Table 4). Phage X11L-1-3 contains DNA that can be amplified with the primer pair 
UtankII-5/LtanklM6, but not with the primer pairs Utankll-5/Ltankll-9 and Utankll-3/Ltank!l-10 (Table 4). Phage 
X11L-1-4 contains DNA that can be amplified with the primer pairs Utankll-5/LtanklM6, Utankll-5/Ltankll-9, but 

25 

20 not with the primer pair Utankli-3/Ltankll-10 (Table 4). Additional sequence was obtained by amplifying phage 
X11L-1-4 DNA using UTankli-3 and LTankl Ml primers (PARP/SAM spanning sequence). 

Example 4: Tankvrase II arnino-termtnal domain sequence 
30 More than 200 GeneTrapper clones obtained using UTankll-4B. UTankll-5B and UTankll-6B 

25 oligonucleotides were probed by colony hybridization with a ^P-labeled PCR fragment from clone MP9 {supra) 
using primers UTank2-30 and LTank2-1. Of the positive clones Identified, sequence was obtained from 5 
independent clones from two different cDNA libraries (Superscript human liver cDNA and Superscript human 
spleen cDNA libraries [Clonetech]). The clones are: S4.66, S4.21, S6.7, S6.91, L5.4. Of these L5.4 had the 
35 longest 5* sequence. L5.4 was deposited with the ATCC (American Type Culture Collection, 10801 University 

30 Boulevard. Manassas, VA) and assigned Accession No. 203919. The DNA encoding the amino-terminal region 
of the Tankyrase 11 polypeptide is extremely GC-rich (>80% in the sequence). 

Figures 2 and 3 show cDNA and protein sequence data obtained for Tankyrase II. 
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TABLE 4: Primers for amplifying Tankyrase II sequences 



Forward & Reverse Primer 
Designation 



Nucleotide Sequence SEQ. ID NO: 



UTANKII-1 GTT ACA TTT GCC ACA GGC AG 24 

UTANKII-2 GTC TTT CTT GCA GTT CAG TG 25 

UTANKIl-3 GAG TCG AGA GAC TTA TCT CC 26 

UTANKIWA GAG CAC AGA GAT GGA GGT C 27 

UTANKIWB ATGTACAGC AACTCCTCCAAGA 28 
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5 


TAQI p A- Prl 


mers tot amplifying t an Ky rase it sequences 






Forward & Reverse Primer 
Designation 


Nucleotide Sequence 


SEQ. ID NO: 




UTANKH-5A 


CAG ACA ATT GCT GGA AGC TG 


29 


10 


UTANKJI-5B 


CAG ACA ATT GCT GGA AGC TGC A 


30 




ITTANK1I-6A 


CTA CTC CTG AGC TAT GGG TG 


31 




UTANKII-6B 


GTG TAC TGT TCA GAG TGT CAA C 


32 




LTANKII-7 


CCA TGC TGG AGC AGA AGT TTG 


33 


15 


LTANKII-8 


GCT AAA ATC TCT CCT GGA ACC 


34 


LTANKII-9 


GTT TGT GCC TAT GTC CAT AAG C 


35 




LTANKil-10 


CAA AAG AGC AGC TGC CTG TG 


36 




LTANK1M1 


CTG CAG GAA AGA CTT TCC CAA G 


37 




UTANKIM2 


GCA GCC AGT GGC CCT CTA CG 


38 


20 


UTANKIM3 


GCC CCA CAG GCC TGT GGC C 


39 




UTANKIH4 


GAA ACT AAT TCC CAC TAA CC 


40 




LTANKIM5 


AAT AAA TAC TGG GCT AGT AC 


41 




LTANKIH6 


AGG GTC TGC ACC ATG CTG GAG C 


42 


25 


LTANKIM7 


ATA AAT CAG CTA CAT TAA CTA C 


43 


LTANKII-18 


CCC AGC TGC AAA ATG AAG T 


44 




LTANKIM9 


AAT GAC TCT GCA GTT GAC AC 


45 




UTANKII-20 


GAT ACA CTC ACC GGA GAA AAG 


46 




LTANKI1-21 


GTG AAC TGG ACA CCC AGT ACC 


47 


30 


UTANKll-22 


GGT ATG GTC GAT GGA TAA ATA G 


48 




LTANKII-23 


GAA CAC AGT ATT GTA TTA G 


49 




1 ITAklU") Irt 

U I 


LGG CGG GCA GGA AAT CCA CC 


50 




LTANK2-31 


TTG GGG TCT GCA CCA TGT CG 


51 


35 


UTANK2-32 


TCC AGA GGC TGG TGA CCC CTG A 


52 


LTANKII-33 


TCT GCT AAA TCC AAT GCT GTC C 


53 




LTANKII-34 


TGC AGC GGG GTG GAT TTC CT 


54 




LTANKII-35 


CAT TTT GAA GCA AAT ATT TA 


55 




LTANKII-36 


GGA ATA AGG CCC CCA TTA TA 


56 


40 


LTANKII-35 


CAT TTT GAA GCA AAT ATT TA 


57 




LTANKII-36 


GGA ATA AGG CCC CCA TTA TA 


58 



Example 5: Northern hybridization of Tankyrase II mRNA 
45 A Northern blot (Human Multiple Tissue Northern (MTN) TM Blot™ (obtained from Clontech, Cat 

5 #7780-1) was hybridized with a 3'UTR probe at 2 x 10° cptn/ml hybridization solution. The 3* UTR fragment 
was amplified by PCR with UTank2-14 and LTank2-15 primers using the Est done n57467 as a template. 

The Northern analysis showed that the Tankyrase II transcript is about 6 to 7.5 kb in length, and is 
expressed In most tissues, including brain, heart, colon, thymus, spleen, kidney, liver, small intestine, lung, and 

50 
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5 peripheral blood leukocytes. It appears to be particularly abundant in skeletal muscle and placenta. *BJ RNA" 

is polyadenylated RNA isolated from a human fibroblast ceil line designated BJ. 

Example 6: Plasmid clones of the Tankvrase II cDNA 
5 The isolated Tankyrase II cDNA bacteriophage clones were transferred to plasmid vectors as follows: 

10 1. The Tankyrase II cDNA contained in bacteriophage X11L-1 was removed as a BsiWI fragment and 

inserted into the Acc65 I site of pBluescrtpt II SK+ (Stratagene) (designated pGRN509). 
2. The Tankyrase tl cDNA contained in bacteriophage M1L-3 was removed as a BsiWI fragment and 
inserted into the Acc65 I site of pBtuescript II SK+ (Stratagene) (pGRN510). 
10 3. The Tankyrase II cDNA contained In bacteriophage X1 1L-4 was removed as a BsiWI fragment and 

* 5 inserted into the Acc65 I site of pBluescrtpt II SK+ (Stratagene) (pGRN511). 

The PARP domain from pGRN509 was amplified with the primer hParpt (5'-CC ATCGAT 
GCCAGCCATG GAG GTT CCA GGA GTA GAT-3'; SEQ. ID NO:59) and primer hParp2 (5 - GCTCT AGA TCA 
GGC CTC ATA ATC TGG-3'; SEQ. ID NOSO) using PFUfTaq polymerase mixture. The resulting fragment was 
15 TA cloned into the InVitrogen TA cloning vector pCR2.1-TOPO®. A clone (pGRN513) was selected with the 
sense strand downstream of the T7 promoter. The primer hParpI introduces an ATG and kozak consensus 
sequence at the 5* end of the PARP 

To assemble a full length cDNA containing the Tankyrase II ORF fragments from pGRN51 1 and 
pGRN509 were combined as follows: The Not 1 fragment of pGRN511 was inserted into the Not I site of 
25 20 pBtuescript II KS+ (Stratagene) (pGRN512). pGRN512 was digested with Nhe I and Cla I and the larger 

vector/Tankyrase II cDNA fragment was Isolated, the -2.1 Kbp Nhe I - Cla 1 fragment of pGRN509 was ligated 
to this fragment to generate a clone containing the full length Tankyrase II cDNA ORF (pGRN514). 

Example 7: Further Sequence Data for the Tankvrase II cDNA 
30 25 The plasmids pGRN509 and pGRN511 were sequenced with an ABI 377 automated DNA sequencer 

by standard techniques using primers complementary to the insert sequences. 

Figure 4 shows the revised cDNA sequence (SEQ. ID NO:5), and the revised amino acid translation 
(SEQ. ID NO:6). The translated protein product is presumed to begin at the Met encoded at position 224 of the 
cDNA sequence, and ending at position 3721. The Met is assigned position No. 1 for purposes of numbering 
35 30 the amino acid translation. However, the upstream polynucleotide sequence shown contains no stop codon, 

and the translation starting Met may be further upslream from the Insert shown in the figure. 

The number of amino acids in Tankyrase II corresponding to nucleotides 284 to 3721 is 1166 amino 
acids long. The calculated molecular weight is 126.8 kDa, and the calculated isoelectric point is 6.78. 
^ Figure 1 shows the location of the functional domains in the Tankyrase II sequence. The position of 

35 each domain within the sequence is shown in Table 5. 



TABLE 5: Location of Functional Domains in Tankyrase II 



50 



Domain 


Position 


GC 


1 to 22 


ANK 


23 to 859 


SAM 


870 to 935 


PARP 


1023 to 1161 
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5 

Figure 5 and Figure 6 compare Tankyrase II (SEQ. ID NO:6) with its closest known intraspecles 
homotog. Tankyrase I (SEQ. ID NOs:8), at the protein level, and at the cDNA level. 

The degree of sequence identity of Tankyrase II relative Tankyrase I was determined in this example 
5 by dividing the two proteins into their functional domains. Identity was then calculated by dividing the number 
/ 0 0 f matched residues by the number of matched and mismatched residues over each area, scoring half a point 

for each unmatched residue occurring in a gap or overhang on either side. Results were as follows: 

• N^erminus: 7 matches/ 7 matches + 20 mismatches + (79 gaps/2) = 7% 

• Ankyrin repeats: 720/ 720+1 04 +<7/2) = 87% 
10 ■ Inter domain 1: 7/7+€+(2/2)= 50% 

15 • SAM domain: 54/54+12=82% 

• Inter domain #2: 82/82+15=83% 

• PARP domain: 132/132+7=95% 

• C-terminus: 0/5+<B/2)=0% 
15 • Overall: 992/1241.5=79.9% 

• Overall (discounting N- and C-termini): 985/1 1 33.5= 86.9% 
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Example 8: Testing for PARP Activity 

To produce a Tankyrase II peptide comprising the PARP domain, pGRN513 was transcribed and 

2 5 20 translated in a 20 fold scale up in vitro coupled transcription/translation (TnT) reaction. Full-length Tankyrase II 

peptides can be obtained by similar procedures, using plasmlds designated pGRN514 or pGRN323. 

Each plasmid was set up as follows, paired with a reaction in which the plasmid DNA was omitted as 
an unprogrammed control. The reaction mixture contained 20 ug of circular plasmid pGRN513 or pGRN514 or 
pGRN523; 500 ul rabbit reticulocyte rysate; 1x TnT Buffer; 20 ul T7 RNA polymerase; 20 pl 1 mM complete 

30 25 amino acids; 20 u.1 RNAguard™; and 6H& to 1 ml total volume. The reactions were incubated for 90 minutes 

at 30°C, then pooled and made 50% with ammonium sulfate. The resulting pellets were washed with 50% 
ammonium sulfate and resuspended in either 400ul PARP buffer A (50mM Tris-HCL pH 8. 4mM MgCh. 0.2 mM 
DTT, 50 mM NaCl, 10mM G-mercaptoethanol. 1 mM PMSF) for the Tankyrase I! reactions, or in 100 pi PARP 
buffer A for the unprogrammed control. The TnT resuspended r/sate was then dialysed overnight against two 

35 

30 changes of PARP buffer A to remove traces of ammonium sulfate. 

The following assays were performed to determine PARP activity: 

1 . ^S-labelled Tankyrase without NAD* 

2. -1 50ng Tankyrase II PARP domain, with NAD* 

4Q 3. ~150ng Tankyrase II PARP domain, with NAD*, 1.5ug TRF1, 5 ug Histones 

35 4. ~150ng Tankyrase II PARP domain, with NAD*. 1.5ug TRF1 . 5 ug Histones 

5. -150ng Tankyrase II PARP domain, with NAD*, 1.5ug TRF1, 5 ug Histones and 1.6 mM 3- 
aminobenzamide 

6. Unprogrammed lysate with NAD. 1 .5ug TRF 1 , 5ug Histones 

45 7. PARP control enzyme (Trevigen cat#4667-5O-01), with NAD*. TRF1 , Histones 

40 For Reaction 1. Tankyrase II was biosynthetically tabeled using ["SJmethionine as a molecular weight 

marker of the non-ribosylated form. The reaction mixture comprised 1 ug of circular plasmid pGRN513 or 
pGRN514 or pGRN523: 25 ul rabbit reticulocyte lysate; 1x TnT Buffer. 1 ul T7 RNA polymerase; 1 uMmM 

50 
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methionine; 2 u.1 r®SJ Met (1000Ci/mmol); 1 uJ RNAguard™; and dH20 to 50 pi total volume. The product was 
precipitated with 50% ammonium sulfate but not dialysed. 

Reactions 2 to 6 were conducted under the foDowing assay conditions: 1x PARP enzyme buffer 
(Trevigen CaW4667-50-02), 40 uM r°P]NAD* (50 pCi). Reactions 2.3 5-7 with TCA were precipitated in 20% 
5 TCA. pellets were washed sequentially with 5% TCA. 90% acetone/1 N HCI and 100% acetone. PeDets were 
then resuspended In 40 uL protein loading buffer and heated for 10 minutes at 80°C. 

Reaction 4 was immunoprecipitated with 10 uL Anti-poly (ADP-ribose) monodonai antibody (Trevigen 
Cat* 4335-MC-100) and 10 uL Anti-poly (ADP-ribose) polyclonal antibody (Biomol Cat# SA-276). Reaction 
volume was incubated on ice for 30 minutes in the presence of the antibodies. 40 uL Protein A slurry was 
10 incubated by rotating at room temperature for 2 hours. Beads were washed twice with PARP buffer A 
containing 50 mM NaCI. and once in PARP buffer A containing 450 mM NaCI. Beads were boiled in 40 pL 
protein loading buffer. Reaction 7 was performed according to manufacturers directions (Trevigen). Samples 
from these reaction mixtures were then analysed on a 12% SDS-PAGE gel. The gel was dried and exposed to 
a phosphorimager screen and imaged. Preliminary results of these assays have been inconclusive 

20 15 

Example 9: Chromosomal location of th e Tankvrase II ofine 

The Tankyrase II gene was localized to chromosome 10q by radiation hybrid mapping (Boehnke et al., 
Am J. Hum Genet 49:1174, 1991: Walter et al.. Nature Genet 7:22) using the medium resolution Stanford G3 
panel of 83 RH clones of the whole genome (created at the Stanford Human Genome Center). A human 
25 20 rymphoblastoid cell line (donor rM) was exposed to 10.000 rad of X-rays and was then fused with 

non-irradiated hamster recipient cells (A3). Eighty-three independent somatic cell hybrid clones were Isolated, 
and each represents a fusion event between an irradiated donor cell and a recipient hamster cell. The panel of 
G3 DNA was used for ordering markers in the region of interest as well as establishing the distance between 
these markers. 

30 25 The primers used for RH mapping were UTANKII-20 and LTANKI1-21 (Table 4). The 83 pools were 

amplified independently and 13 (16%) scored positive for Tankyrase II. The amplification results were 
submitted to the Stanford RH server, which then provided the map location, 10q23.3, and the closest marker, 
STS D10S536. 

35 

30 Example 10: Transcription and Translation of Tankvrase II PARP domain 

PGRN513 was tested by in vitro transcription/translation (TnT) to confirm it encoded the appropriate 
sized protein. The PARP domain is expected to run at approximately 35 kDa, as determined by SDS 
polyacrylamide gel electrophoresis. TnT reactions were set up for pGRN513 and pGRN125 (hTERT as a 
40 positive control for the TnT reaction) with the following components: 

35 • 1 jig of circular plasmid 

• 25 pi rabbit reticulocyte tysate 

• 1 x TNT Buffer (Promega Cat # L4810) 
45 • 1 u!T7 RNA polymerase 

• 1 pi 1mM methionine 

40 • 2 pi [^S] methionine (1000 Ci /mmol) (Amersham Cat # SJ 1015, 1000 Ci/mmol) 

• 1 pi RNAguard (Pharmacia Cat # 27-0815-01) 
50 • dH20 to 50 pi total volume 
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The reaction was incubated (or 90 minutes at 30°C. 40 uJ of the TnT reactions were precipitated with 
50% ammonium sulfate and resuspended in 40 u.1 of buffer (20mM HEPES-KOH pH 7.9. 2 mM MgCfe. 1mM 
EGTA, 10% glycerol. 0.1% Nonidet P-40. 0.1mM phenytmethylsulphonyl nuoride) /lOOmM NaCI. 5 ul of TnT 
reaction, 5 ul of ammonium sulfate cut TnT reaction and 5 uL of the ammonium sulfate cut was analyzed on a 
12% SDS-PAGE. pGRN513 generated the expected size fragment. 

Example 1 1 : Antibodies to Tankvrase II 

Peptides are prepared on a synthesizer for use as immunogens based on the sequence data shown 
in Table 6. 



TABLE 6: Peptide Immunogens 


Laboratory Designation 


Sequence 


Specificity 


SEQ. ID NO: 


GCJT-1 
GCJT-2 
GCJT-3 

QCJT-4 


MAASflRSQC 
MSGRRCAGK 
QBGJSLGNSEADRQC 

GEYKKDELLEC 


residues 1-8 of Tankyrase I 

residues 1-B of Tankyrase II 

residues 481-494 of Tankyrase H 

residues 269-276 of Tankyrase ll; 
common to both proteins 


61 
62 
63 

64 



Underlined residues do not belong to the native sequence of the proteins but are added to the peptides in order 
to couple them to carriers for antibody production. 

15 BIOLOGICAL DEPOSIT 

30 

Phage X11L-1-1, X11L-1-3. X11L-1-4 were deposited as a mixture with the ATCC (American Type 
Culture Collection, 10801 University Boulevard, Manassas, VA ) on April 12, 1999, under Accession No. 
203919. 

20 Phage are stored in a buffer of 5.8 g NaCI. 2 g MgS0 4 &H 2 0, 50 mL 1 M Tris-HCI pH 7.5, and 0.01% 

gelatin (Difco). To isolate each phage, test individual phage are separated by plaque purification by standard 
PCR amplification. Tankyrase II sequence in phage M1L-1-1 can be amplified with the primer pairs Utanktl- 
5/Ltankll-16, Utankll-5/Ltankll-9, and Utankll-3/Ltankll-10 (Table 4, supra). Tankyrase II sequence in phage 
X11L-1-3 can be amplified with the primer pair UtankH-5/Ltankll-16. but not with the primer pairs Utankll- 
40 25 5/Ltankll-9 and Utankll-3/Ltankll-10. Tankyrase II sequence in phage X11L-1-4 contains DNA that can be 

amplified with the primer pairs Utankll-5/Ltankll-16, Utankll-5/Ltankll-9, but not with the primer pair Utankll- 
3/LtanklM0. 
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TABLE 7: Sequences Listed in this Disclosure 



10 



15 



20 



SEQ. ID NO: 


Subject 


Reference 


1 


Human Tankyrase II DNA sequence 


FIGURE 2. this Invention. (607128,577) 


2 


Human Tankyrase II protein sequence 


FIGURE 2, this Invention. (60/128.577) 


3 


Human Tankyrase II DNA sequence 


FIGURE 3, this Invention. (60/129.123) 


4 


Human Tankyrase II protein sequence 


FIGURE 3. this Invention. (60/129.123) 


5 


Human Tankyrase II DNA sequence 


FIGURE 4, this Invention. 


6 


Human Tankyrase II protein sequence 


FIGURE 4. this Invention. 


7 


Human Tankyrase I DNA sequence 


GenBank Accession No. AF082556 
Smith et al. Science 282:1484 (1998) 


8 


Human Tankyrase I protein sequence 


GenBank Accession No. AF082556 
Smith et al. Science 282:1484 (1998) 


9 


Human cDNA clone similar to 
Ankyrin G119 mRNA 


GenBank Accession No. AA244138 
R. Strausberg (unpublished) 


10 


Human cDNA done similar to Ankyrin- 
related ADP-ribose polymerase mRNA 


GenBank Accession No. AW1 57349 
L. HUUer et al. (unpublished) 



TABLE 8: Additional Sequence Data 
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GenBank Accession 
No. 


Subject 


Reference 


U40705 


TRF1 sequence 


Chong et al., Science 270:1663 (1995) 


AF002999 


TRF2 sequence 


Broccoli et al., Nature Genet. 17: 231 (1997) 


AF 19551 2 


T1N2 sequence 


Kim et al.. Nat Genet. 23 :405 (1999) 


• For purposes of prosecution in the U.S.A., the DNA and encoded amino acid sequences listed in this 
are hereby incorporated herein by reference. 
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CLAIMS 

What is clamed as the invention Is: 

1 . An Isolated polynucleotide having at least one of the following properties: 

a) It comprises a sequence of at least 30 consecutive nucleotides that is contained in SEQ. ID 
NO:5, but not In SEQ. ID NO:7; 

b) it comprises a sequence of at least 30 consecutive nucleotides that is contained in plasmids 
deposited under Accession No. 203919. but not in X-phage or SEQ. ID NO:7; 

c) it comprises a sequence of at least 100 consecutive nucleotides that is at least 90% identical to 
a sequence contained in SEQ. ID NO:5, but not in SEQ. ID NO:7; 

d) it comprises a sequence of at least 100 consecutive nucleotides that is at least 90% identical to 
a sequence contained in plasmids deposited under Accession No. 203919, but not in X-phage or SEQ. 
ID NO:7; or 

e) it comprises a sequence of at least 30 consecutive nucleotides that hybridizes under stringent 
conditions to a polynucleotide with the sequence in SEQ. ID NO:1, but not to a polynucleotide with the 
sequence in SEQ. ID NO:7. 

2. The isolated polynucleotide of claim 1 , which encodes a protein comprising a GC domain, a PARP 
domain, a SAM domain, or an ANK domain. 

3. The isolated polynucleotide of claim 1 or claim 2, which encodes a protein having at least one of the 
following properties: 

a) it binds to any of the proteins TRF1 , TRF2, TIN2, and Tankyrase I; or 

b) it ADP-ribosylates a target protein in the presence of nicotinamide adenine dinucleotide (NAD*). 

4. An isolated polypeptide having at least one of the following properties: 

a) it comprises a sequence of at least 10 consecutive amino acids that is contained in SEQ. ID 
NO:6, but not in SEQ. ID NO:8; 

b) it comprises a sequence of at least 10 consecutive amino acids that is contained in plasmids 
deposited under Accession No. 203919. but not in X-phage or SEQ. ID NO:8; 

c) it comprises a sequence of at least 25 consecutive amino acids that is at least 90% identical to a 
sequence encoded within SEQ. ID NO:6. but not In SEQ. ID NO:8; or 

d) it comprises a sequence of at least 25 consecutive nucleotides that is at least 90% identical to a 
sequence encoded within plasmids deposited under Accession No. 2D3919, but not in X-phage or SEQ. 
ID NO:8. 

5. The isolated polypeptide of claim 5, comprising a GC domain, a PARP domain, a SAM domain, or an 
ANK domain. 

6. Tne isolated polypeptide of claim 4 or daim 5, having at least one of the following properties: 

a) it binds to any of the proteins TRF1 , TRF2. TIN2, and Tankyrase I; or 

b) It ADP-ribosylates a target protein in the presence of nicotinamide adenine dinucleotide (NAD*). 
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7. Isolated human Tankyrase II protein, at least 10-fold more pure on a weight per weight basis than 
occurs in natural sources. 

8. An isolated polynucleotide encoding the polypeptide of any of daims 4-7. 

9. An isolated polyclonal antibody or a monoclonal antibody that binds specifically to a polypeptide with the 
sequence SEQ. ID NO:6 but not to a polypeptide wtth the sequence In SEQ. ID NO: 8. 



10. The antibody of claim 9, which inhibits the catalytic activity of Tankyrase II; inhibits the binding of 
10 Tankyrase II to any of the proteins TRF1, TRF2. TIN2, or Tankyrase I; or Inhibits protein ribosylation 

15 mediated by Tankyrase II in the presence of NAD*. 

11. A method for obtaining an antibody according to claim 9 or 10, comprising immunizing an animal or 
contacting an immunocompetent particle with a polypeptide according to any of daims 4-7. 

15 

20 12. A method for isolating a peptide according to any of claims 4-7 from a protein mixture, comprising 

incubating the mixture with an antibody according to daim 9 or 10 under conditions that permit specific 
binding of the peptide to the antibody to form a complex, separating antibody-peptide complexes from 
the mixture, and recovering the peptide from such complexes. 



25 



13. A method for obtaining a peptide according to any of claims 4-7, comprising expressing a polynudeotide 
of any of daims 1 -3 and daim 8 in a host cell. 



14. A method for ribosylating a target protein, comprising incubating the target protein with a peptide 
3 0 25 according to any of daims 4-7 in the presence of NAD*. 

15. An assay method for determining Tankyrase II binding activity in a sample, comprising incubating a 
sample suspected of containing a protein having Tankyrase II binding activity with a peptide according 
to any of claims 4-7 under conditions where the protein can bind the peptide specifically to form a 

35 30 complex, and then correlating any complex formed with the presence or amount of the protein in the 

sample. 

16. The assay method of daim 15, wherein the protein that has Tankyrase II binding activity is selected from 
the group consisting of TRF1. TRF2, T1N2, and Tankyrase I. 



40 35 



17. A method of screening a test compound for an ability to affect Tankyrase II activity, comprising 
incubating a readion mixture containing a peptide according to any of daims 4-7, a protein that has 
Tankyrase II binding activity, and the test compound under conditions where the protein could bind the 
peptide to form a complex in the absence of the test compound; and determining any effect of the test 



45 

40 compound on the amount or rate of complex formation. 
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18. A method of screening a test compound for an ability to affect Tankyrase II activity, comprising 
incubating a reaction mixture containing a peptide according to any of claims 4-7, a target protein, 
nicotinamide adenine dinudeoDde (NAD*), and the test compound, under conditions where the target 
protein would be ribosytated by the peptide in the absence of the test compound; and determining any 
effect of the test compound on the amount or rate of ribosylation. 



10 



19. A method for modulating Tankyrase II expression in a cell, comprising contacting the cell with the 
polynucleotide of any of claims 1-3 which is an antisense polynucleotide, a ribozyme. or interfering RNA 
(RNAf) under conditions where the polynucleotide inhibits translation of mRNA into Tankyrase II. 

10 

15 20. A method for modulating telomere length in a cell, comprising modulating the expression of Tankyrase II 

in the ceO. 
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Figure 1 
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Figure 2(1) 
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Figure 2(2) 
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Figure 2(3) 
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Figure 2(4) 
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Figure 2(5) 
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Figure 2(6) 
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CATTCCTTTGTAAT ATAA A 1 V. i. l» * « * 1A SAACTCCAAAAAAAAAAAAAAAAAA 
4441 * * * * «. 4493 
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CCCACGCGTCCGGGCAGGAGGGG CCTTGCCAGCTTCCG 



HASGQEGPCQLPPPRRFRT 



137 



13B + + «. — + * * - 197 

RTADS RCLRRRGAAGGQGAH 
CGAANGGGCGCGCGTGGGCGCGGCCATGGGACTGCGCCGGATCCGGTGACAGCAGGGAGC 



198 



257 



R7GARGRGHGTAPDPVTAGS 



258 — ► * + ♦ ♦ - 317 

QAARALSASSPGGLALLLAG 
CCGGGGCTCCTGCTCCGGTTGCTGGCGCTGTTGCTGGCTGTGGCGGCGGCCAI^GATCATG 



318 



377 



378 



PGLLLRLLALLLAVAAA? I M 
TCGGGTCGCCGCTGCGCCGGCGGGGGANCGGCCTGCGCGANCGCCGCGGCCGAAGCCGTG 



437 



A C A 



A A A E A V 



SGRRCAGGG 

GAJCCGGCCGCCCGAAANCTGTTCGAAGCGTGCCGCAACGGGGACGTGGAACGANTCAAG 
438 —J * + + + + 497 



498 



558 



618 



678 



E|FAAR?LFEACRNGDVER?K 

AAGCTGGTGACNCCTGARAAGGTGAACAGCCGCGACACNGCGGGCAGGAAATCCACCCCG 

KLVTPEKVNSRDTAGRKSTP 
CTGCACTTYCCCGCANGTTTTGGGCGGAAAGACTTAOTTRAATATTTGCTTCANAATGGT 

LHFPA7FGRKDL? ? Y L L ? N G 
GCAAATGTYCAANCACGTGATOATGGGGGCCTTATTCCTCTTCATAATGCATGCTCTTTT 

ANVQ?RD?GGLIPLHNACSF 
GGTafTGCTRAAANTATCNATCTCCTTTTGCNACATOGTGCAI^AMCCCAATGCTCGAGAT 

G?A??I?LLL?H?A?PNARD 
AATTGGAATTATACTCCTCNCKATGAAGCTGCAATTAAAGGAAAGATTGAIvT^nTGCATT 



557 



617 



677 



737 



738 



797 



W N Y T P ? 



E A A 



K G K I 



C I 
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GTGCTGTTACAGCATGGAGCTGAGCCAACCATCCGAAATACAGATGGAAGGACAGCATTG 
79B ~+ «. — + ♦ + ♦ g 57 

V L L Q K G , % A EPTIRNTDGRTAL 

G ATTTAGCAGATC CATCTGCCAAAGCAGTGCTTACTGGTG AATAT AAGAAAGATG AACTC 
85B — + + + + + + o 17 

DLADPSAKAVLTGEYKKDEL 

TTAGAAAGTGCCAGGAGTGGCAATGAAGAAAAAATGATGGCT 
918 + + + ♦ + + 977 

LESARSGNEEKMMALLTPLN 

GTCAACTGCCACGCAAGTGATGGCAGAAAGTCAACTCCATTACATTTGGCAGCAGGATAT 
978 — + + + + + + 1037 

VNCHASDGRKSTPLHLAAGY 

AACAGAGTAAAGATTGTACAGCTGTTACTGCAACATGGAGCTGATGTCCATGCTAAAGAT 
1038 — + — + + ♦ + — - 1097 

NRVKIVQLLLQHGADVHAKD 

AAAGGTGATCTGGTACCATTACACAATGCCTGTTCTTATGGTCATTATGAAGTAACTGAA 
1098 — + -* + + + 1157 

KGDLVPLHNACSYGHYEVTE 

CTTTTGGTCAAGCATGGTGCCTGTGTAAATGCAATGGACTTGTGGCAATTCACTCCrc 
1158 — +- + + + + 1217 

LLVKHGACVNAMDLWQFTPL 

CATGAGGCAGCTTCTAAGAACAGGGTTGAAGTATGTTCTCTTCTCTTAAGTTATGGTGCA 
1218 + + + + + 1277 

HEAASKNRVEVCS LLLSYGA 

GACCCAACACTGCTCAATTGTCACAATAAAAGTGCTATAGACTTGGCTCCCACACCACAG 
12 78 + ♦ ♦ + + + 1337 

DPTLLNCHNKSAIDLAPTPQ 

TTAAAAGAAAGATTAGCATATGAATTTAAAGGCCACTCGTTGCTGCAAGCTGCACGAGAA 
1338 - — + * + ♦ + 1397 

LKERLAYEFKGHSLLQAARE 

GCTGATGTTACTCGAATCAAAAAACATCTCTCTCTGGAAATGGTGAATTTCAAGCA 
1398 ~* + ♦ ♦ — 1457 

ADVTRIKKHLSLEMVNFKHP 

CAAACACATGAAACAGCATTGCATTGTGCTGCTGCATCTCCATATCCCAAAAGAAAGCAA 
1458 __ + + + ♦ + * 1517 

QTHETALHCAAAS PYPKRKQ 

AT AT G TG AACTG TTG CT AAG AAAAG G AGCAAAC ATC AATG AAAAG AC T AAAGAATTC TTG 
1518 — — +— +— + + 1577 

ICELLLRKGANINEKTKEFL 
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ACTCCTCTGCACGTGGCATCTGAGAAAGCTCATAATGATGTTGTTGAAGTAGTGGTGA/iA 
15 78 +— ♦ ♦ * 1537 

TPLHVASEKAHNDVVEVVVK 

C ATG AAG CAAAGG TT AATGCTCTGG ATAATCTTGGTC AG ACTTCTCT ACAC AG AG C TGC A 
1638 + ♦ * — +- *- 1697 

HEAKVNALDNLGOTSLHRAA 

T ATTG TGG TCATCT AC AAACCTG CCG CCTACTC CTG AGCT ATGGG TGTG AT CCT AAC ATT 
1698 + ♦ * 1757 

YCGHLQTCRLLLSYGCDPNI 

ATATCCC TTCAGGGCTTTACTGCTTTACAG ATGGG AAATGAAAATGT AC AGCAAC TCCTC 
1758 + + ♦ + + 1817 

I SLQGFTALQMGNENVQQLL 

CAAGAGGGTATCTCATTAGGTAATTCAGAGGCAGACAGACAATTGCTGGAAGCTGCAAAG 
1818 ♦ + ♦ + + 1877 

QEGI SLGNSEADRQLLEAAK 

GCTGGAGATGTCGAAACTGTAAAAAAACTGTGTACTGTTCAGAGTGTCAACTGCAGAGAC 
1878 * + * + ♦ 1937 

AGDVETVKKLCTVQSVNCRD 

ATTGAAGGGCGTCAGTCTACACCACTTCATTTTGCAGCTGGGTATAACAGAGTGTCCGTG 
1938 — + +- * + * * 1997 

IEGRQSTPLHFAAGYNRVSV 

GTGGAATATCTGCTACAGCATGGAGCTGATGTGCATGCTAAAGATAAAGGNGGCCTTGTA 
1998 ~* + + ♦ ♦ ♦ 2057 

VEYLLQHGADVHAKDKGGLV 

CCTTTG C ACAATG CATGTT1TTTATGG ACATT ATG AAGTTG CAGAACTTCTTG T TAAACAT 
2058 -* + + + * 2117 

PLHNAC?YGHYEVAEL LVKH 

GGAGCAGTAGTTAATGTAGCTGATTTATGGAAATTTACACCTTTACATGAAGCAGCAGCA 
2118 -* * ♦ — * + 2177 

GAVVNVADLWKFTPLHEAAA 

AAAGGAAAATATGAAATTTGCAAACTTCTGCTCCAGCATGGTGCAGACCCTACAAAAAAA 
217B — +™ + + * «■ 2237 

KGKYEICKLLLQHGADPTKK 

AAAAAAAAAGGAAATJATTanTTGGATCITGTTAAAGATGGAGANACAGATATTCAAGAT 
2238 — «■ -+ + + 2297 

KKKG?I?I*DLVKDG?TDIQD 

NTGCTTAGGGGAGATGCAGTTTT G TTAGATGCTGCCAAGAAGGGTTGTTTAGCCr.GAGTG 
2298 — * + ♦ ♦ 2357 

? LRGDAVLLDAAKKGCLARV 
AAGAAGTT^WITTTCCTGATAATCTAA^ 
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«. ♦ ♦ + + + 2417 

KK? ? FPDNVNCRDTQGRHST 

CCTTTACATTTAGCAGGTNNNNOTttJN^^ 

+ + + ♦ * 2477 

PLHLAG????????? ????? 

NNNNNNNttJNNNNNNNNNNNN^ 

+ + ♦ ♦ + 2537 

???????????????????? 

NNNl^^NNNNNNNNNNl^WNNNNN^ 

__ + + ♦ * ♦ 2597 

???????????????????? 

NNNNNNNNNNNNNNNNTMNl^NNNNN^ 

+ + + + + 2657 

???????????????????? 

NNNMOTINNNNNNNNNNNNNNNNN^^ 

+ + + «• + + 2717 

???????????????????? 
NNNNNNNNNNNNNl^JNNNNN^^ 

„«. + + ♦ + + 2777 

??, ??????????TAAMPPSV 

CTGCCCTCTTGTAACAAGCCTCAAGTGCTCAATGGTGTGAGAAGCCCAGGAG 

+ + + + + + 2837 

LPSCNKPQVLNGVRSPGATA 

G ATGCTCTCTCTTCAGGTCC ATCT AGCC C ATCAAGCC TTTCTGCAGCC AGCAGTCTTG AC 

+ + + + 2897 

DALSSGPSSPSSLSAASSLD 

kACTTATCTGGGAGTTTTTCAGAACTGTCTTCAGTAGTTAGTTCA^TGGAACAGAG^ 

2957 




NLSGSFSELSSVVS SJ S G T E G 

gcttccagtttggagaaaaac/S^gttccag^^ 

__ + + + * + 3017 

asslekk|evpgvdfsitqfv 
aggaatcttggacttgagcacctaatggatatatttnagagagaacagatcact ^^^^ 

rnlglehlmdif? reqitld 
gt attagttg ag atg gggcac aagg agctg aagg ag attggw atc aatgctt at g g ac at 



3137 

VLVEKGHKELKEIGINAYGH 

A 

3197 



V Li v c n v " *- « »» — _ 

AGGC ACAAACT AATTAAAaG t tTC G AGAGACT^^CTX^GAC AAC AAGGTCTTAACC C A 
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R H K L I K if^F ERLISGQQ N P 

TATTTAACTTTGAACACCTCTGGTAGTGGAACAATTCTTATAGATC^^ 
3198 ♦ + + + + 3257 

YLTLNTSGS GTI LZDLSPDD 

AAAGAGTTTCAGTCTGTGGAGGAAGAGATGCAAAGTACAGTTCGAGAGCAC^ 
3258 — + + + ♦ ♦ 3317 

KEFQSVEEEMQSTVREHRDG 

GGTCATG CAGG TGGAATCTTCAAC AGATACAATATTCTC AAGATTCAG AAGGTTTCT 
3318 + + + + + 3377 

GHAGGIFN RYNILKIQKVCN 

AgagccaagattcggcacgaGGAAAGATACACTCACCGGAGAAAAGAAGTTTCTGAAGAA 
3378 — + +— + + + + 3437 

RAKIRHEERYTHRRKEVSEE 

AACCACAACCATGCCAATGAACGAATGCTATTTCATGGGTCTCCTTTTGTG 
3438 + + + + + 3497 

NHNHANERMLFHGSPFVNAI 

ATCCACAAAGGCTTTGATGAAAGGCATGCGTACATAGGTGGTATGTTTGGAGCTGGCATT 
3498 -+ + + + + 3557 

IHKGFDERHAYIGGMFGAGI 

TATTTTGCTGAAAACTCTTCCAAAAGCAATCAATATGTATATGGAATTGGAGGAGGTACT 
3558 t + + + + + 3617 

YFAENSSKSNQYVYGIGGGT 

GGGTGTCCAGTTCACAAAGACAGATCTTGTTACATTTGCCACAGGCAGCTGCTCTTTTGC 
361B „ + + + — + + + 3677 

GCPVHKDRSCYICHRQLLFC 

CGGGTAACCTTGGGAAAGTCTTTCCTGCAGTTCAGTGCAATGAAAATGGCACATTCTCCT 
3678 „ + + + + ~+ + 3737 

RVTLGKSFLQFSAMKMAHSP 

CCAGGTCATCACTCAGTCACTGGTAGGCCCAGTGTAAATGGCCTAGCATTAGCTGAATAT 
3738 -4 + -+ ♦ + * 3797 

PGHHSVTGRPSVNGLALAEY 

GTT ATTT ACAG AGGAG AAC AGGCTT ATC CTG AG T ATTT AATT AC TT AC C AG ATT ATG AG G 
3798 -+ +— -+ + + - 3857 

VIYRGEQAYPEYLITYQIMR - 

CCTGAAGGTATGGTCGATGGATAAATAGTTATTTTAAGAAACrAATTCCACrc 
3858 + ♦ — + ♦ ♦ - 391 7 

PEGMVDG* IVILRN*FH*T* 

AATCATCAAAGCAGCAGTGGCCICTACGTTTTACTCCTTTC 
3918 __ + + + + ~+ + 3977 
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NHQSSSGLYVLLLC* KKIXL 

CCCACAGGCCTGTGGCAAAAGGATAAAAATGTGAACGAAGTTTAACATTCTGACTTGATA 
3978 — + ♦ + + ♦ * C037 

P T G LWQKDKNVNEV * H S D L X 

AAGCTTTAATAATGTACAGTGTTTTCTAAATATTTCCTGTTTTTO 
4038 — — + * 4 097 

KL**CTVFSKYFLFFQHFNR 

TGCCATTCCAGGTJAAACTGGGTTGTCTGTACTAAATTATAAACAGAGTTA^^ 
4098 «■ — ~ ♦ — * + 4157 

CHSRLMWVVCTKL*TELT*T 

TTTTATATGTTATGCATTGATTCTAACAAACTGTA^ 
41SB — + + — + 4217 

FYMLCIDSNKL*CPQQN* FY - 

T AAT AC AAT ACTGTGTTCTTTAAAACAC AG C ATTT AC ACTG AAT AC AATTT C ATTT G T AA 
4218 ♦ * * - * 4277 

* YNTVFFKTQHLH * IQFHL* 

AACTGTAAATAAGAG CTTTTGTACTAG C C C AGT ATTT ATTT AC ATTG CTTTGT AAT AT AA 

4278 «■ + + + + 4337 

NCK*ELLY*PSIYLHCFVI* 

ATCTGTTTTAGAACTCCAAAAAAAAAAAAAAAAAATC 

433B -+ * * 4374 

ICFRTAKKKKKN 
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GQGAGGAGOG CCCTTGCCAG CTTCOOCCOC CGCGTCGTTT CAQGACCCQG ACOGCOGATT CGCCCTQXT CC GC C OC C CC O G GG C ASCCG GGOGGCA GGG 103 

ACOCCRT0GA GGOGCOCQCG TO OO COC O CC CATGGGACTG OGCCGGMtC GCTGACAGCA GGGACCCAAG C GSCCCGGGC CCTGaCCCCC rCTTCTCCGG 200 

GOOOCCTOGC CCTCCTCCTC GOGGGGCCGG GGCTCCTGCT CCOGTTGCTG COCCTCTTOC TGGCTGTGGC GGCGCCCAGC ATCATGTCGG GTCGCC GC TO 300 

MSG R R C 



- - - - 400 

A G G Q A A C ASA A A E A V E F A A B ELF E A C X W G D VER 

CTCAASAGSC T3GTGACGCC TCAGAAGGTG AACAGCOQCQ ACACGSCGGO CACGAAATCC ACCCCOCTGC ACTTCOCCOC A OOTTTTOO G CGGAAAGACC 500 
V R R L V T P EKV NSRD TAG RKS T P L H P A A G F C RKOV 

TAGTTSAATA TTTGCTTCAG AA7QGTGCAA ATCTCC A A3C ACCTGATGAT GOOQGCCTT A TTCCTCTTCA TAATGCATGC T Vir T TGUTC A7 GC T GA A5T 600 
VET L L 0 MGAS V 0 A RPD O O L I PLK MAC SPGH A E V 

AOTCAATCTC CTTTTGCGAC ATGGTCCAGA CCCCAATGCT CGAGATAATT CGAATTATAC TCCTCTCCAT GAAGCTCCAA TTAAAGGAAA GATTGATG7T 700 
V IJ h L L R H GAD P K A RDHW HYT PLH B A A I KGX XDV 

TGCATTCTGC TGTTACAGCA TQGAGCTGAG CCAACCATCC GAAATACAGA TGGAAGGACA GCATTCGATT TAGCAGATCC ATCTGCCAAA OCAQT GC TTA BOO 
CIVL LQH OAE PT1R MTD CRT ALDL A O P S A K A V L 7 

CTGCTCAATA TAACAAAGAT CAACTCTTAG AAAGTGCCAC GAGTGGCAAT GAAGAAAAAA TGATGCCTCT ACTCACACCA 7TAAATOTCA ACTOCCACGe 900 
GET K X D ELLE SAR SGN EERK HAL LTP L H V H CHA 

AAGTGATGGC AGAAAGTCAA CTCCATTACA TTTGGCAGCA G3ATATAACA GAGTAAAGAT TCTACAGCTG TTACTGCAAC ATQGACCTCA TGTCCATCCT 1000 
SDG RRST PLH L A A GTHR VRI VOL L L Q H GAD V H A 

AAAGATAAAG GTGATCTGGT ACCATTACAC AATOCCTGTT CTTATOGTCA TTATCAASTA ACTGAACTTT TGGTCAAGCA TGGT CC CT G T GTAAATGCAA 1100 
X D K G DLV PLH MACS TGH YEV TELL V X K GAC VWAM 

TGGACTTGTG GCAATTCACT CCTCTTCATG AGOCAGCTTC TAAGAACAGG CTTGAAGTAT Gm.TL - y.Tl T CTTAACTTAT CCTGCAGACC CAACACTGCT 1200 
DLM OFT PLHE A A S X N R VEVC SLL L S Y GADP T L L 

CAATTGTCAC AATAAAAGTG CTATASACTT COCTCCCACA CCACACTTAA AAGAAAGATT AGCATATGAA TT7AAAGGCC ACTCGTTGCT GCAAGC7CCA 1300 
NCH NKSA IDL APT POLK ERL ATE FKGB SLL 0 * * 

CGACAACCTC ATGTTACTCG AATCAAAAAA CATCTCTCTC TQGAAATQCT GAATTTCAAG CATCCTCAAA CACATGAAAC AOCATTGCAT TCTOCTGCTG 1400 
BEAD VTR IKK HLSL EMV KFK HPOT HET A L H C A A A 

CATCTCCATA TCOCAAAAGA AAGCAAATAT GTCAACTGTT GCTAAGAAAA GGAGCAAATA TCAATGAAAA GACT.AAA3AA TTCTTGACTC CTCTGCACG7 1S00 
SPT PKR KQIC ELL L R K G A N I NEK TKE FLTP 1> H V 

GGCATCTGAG AAAGCTCATA ATGATGTTGT 7SAAGTAGTG GTGAAACATG AAOCAAAGGT TAATOCTCTG GA7AATCTTC GTCRGACTTC TCTACACAGA 160O 
ASE X A H N DVV EVV VKHE A K V NAL DNLC 0 ? S LHR 

GCTCCATATT CTGGTCATCT ACAAACCTGC CGCCTACTCC TGAGCTATGG GTGTGATCCT AACATTATAT CCCTTCA3QG CTTTACTGCT TTACAGATGG 17 00 
AATC GHL QTC RLLL SYC CDP MIIS LQG FTA LQMG 
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GAAATCAAAA TGTACAGCAA CTCCTCCAAC AGGC7ATCTC ATTAOCTAAT TCAGACGCAG ACAGACAATT GCTGGAAGCT OCAAAOCCTG GAGATCTCCA 1850 
BED V00 L L O E CIS L G M S E A D R 0 L LEA A X A C DVE 

AACTOTAAAA AAACTGTSTA CTOTTCA CAO TG7CAACTGC AGAGACATTG AAGOGCOTCA CTCTACACCA CTTCATTT7G CAGCTOCGTA TAACXCACTC 1900 
T V X K L C T V 0 «S V H C R O X E G R 0 S T P L H F A ACY H R V 

T CO 37G 0 T0Q AATATCT3CT ACAGCATOGA CCT GA TPPGC ATGCTAAAGA TAAAGGAGGC CTTG f T A CCTT TGCACAATGC ATGTTCTTAT GGACATTATG 20 00 
SVVE Y L L 0 H G A D V H A K D R 0 Q L V P L BNA C S Y CHYE 

AACTTGCAGA ACTTCTTCTT AAACATGGAG CA3TAGTTAA TCTAGCTGAT TTATGGAAAT TTACACCTTT ACATGAAGCA GCAGCAAAAO GAAAATATGA 2100 

V A E LLV XHOA V V » V A D L W X F T P L BEA A A R G KYE 

AATTTOCAAA LI ICTOCTOC AOCATOCTGC AGACCTTACA AAAAAAAACA GCGATGGAAA tACTCCTTTO GATCTTGTTA AAGATGQACA TACAGATATT 2200 
ICR LLLO H OA DPT X X N R DON TPL DLVX D G D T D X 

CAAGATCTGC TTAOOOGAGA TGCAGCTTTG CTAGATOCTG OCAACAAOGO TT G TTT A GCC AGAGTCAAGA ACTTGTCTTC TCCTGATAAT CTAAATTGCC 2300 
ODLL ROD A A L LDAA RKC CLA XVKX L S S PDN V M C ft 

O0GA7ACCCA AOGCAGACAT TCAACACCTT TACATTTAGC ACCTCGTTAT AATAATTTAG AACTTGCAGA GTATTTCTTA CAACACGGAG CTGATCTGAA 2*00 
DTO Q R H STPL 8 L A AOY K H L E VAC YLL O B G A DVD 

TGCCCAAGAC AAADGAGGAC rTATTOCTT I ACATAATCCA CCATCTTACC OGCATCTASA TGTAGCACCT CTACTAATAA AGTATAA70C ATGTGTCAAT 2500 
AQD RCOL IPL BNA ASYO BVD V A A LL2K T H A CVN 

GC CA OQ GA CA AATGGGCTTT CACACCTTPG CACGAAGCAG CCCAAAASGG ACGAACACAC CTTTOTGCn TGTTGCTAGC CCATCCAGCT GACCCGACTC 2600 
ATDR NAT TPL H E A A QXG R 7 Q LCAL L 1. A HGA DPTL 

T7AAAAATCA GCAAOGACAA ACACCTTTAG ATTT AO TTT C AGCGCATGAT GTC A GCCCTC TTCT GA CAOC AOCCATGCCC CCATCTGCTC TOCCCTCTPG 2700 
KNQ EGQ TPLO LVS ADD VSAL L T A AMP P5AL PSC 

TTACAAGCCT CAACTGCTCA ATGCTGTGAG AAGCCCAGCA GCCACTOCAa ATGCTCTCTC T7CAGCTCCA TCTASCCCAT CAACCCT7TC TGCAGCCA3C 2800 
TKP QVLN GVR SPG ATAD ALS SGP SSFS S L S AAS 

ACTCTTGACA ACTTATCTOG GAGTTTTTCA GAACTGTCTT CAGTAGTTAG TTCAAGTGGA ACAGAOGGTG CTTCCAGTTT GGAGAAAAAG GAGGTTCCAG 2900 
SLON LSG SF5 ELSE VVS SSG TEGA SSL ERR EVPG 

CACTACATTT TACCATAACT CAATTCGTAA OGAATCTTGG ACTTCAGCAC CTAATOGATA TATTTGAGAG AGAACAGATC ACTTTGGATG TATTAGTTGA 3000 

V D P SIT OPVR NLG LE M LMDI FEX E 0 I TLDV LVE 

GATGGGGCAC AAOGAGCTGA ACGAGATTGC AATCAATGCT TATOGACATA CGCACAAACT AATTAAAGGA GTCGAGAGAC TTATCTCCGQ ACAACAAGGT 3100 
UGH RELR EIO I N A YGHR HKL ZKG VERL ISO Q 0 C 

CTTAACCCAT ATTTAACTTT GAACACCTCT GGTXCTGGAA CAATPCTTAT AGATCTGTCT CCTGATGATA AAGAGTTTCA GTCTGTGGAG GAAGAGATGC 3200 
LNPY LTL NTS GSGT I L I DLS PDDX E F 0 SVE E E H Q 

AAAGTACAGT TCGAGAGCAC AGAGATGCAG GTCATGCAGG TGGAA TC TTC AACAGATACA ATATTCTCAA CATTCAGAAG GTTTGTAACA ACAAACTATG 3300 
STV REH R D G G BAG GIF N R T N ILR I 0 K VCNK RLK 

GCAAAGATAC ACTCACOGGA GAAAASAAGT TTCTGAAGAA AACCACAACC ATCCCAATGA ACGAATGCTA TTTCATOGGT CTCCTTTTGT GAATOCAATT 3400 
ERY THRR REV SEE NHNB ANE RML FBCS PFV N A I 

ATCCACAAAG GCTTTCATCA AAQQCATGCG TACATAGGTG CTATGTTTGG AGCTCGCATT TATTTTGCTG AAAACTCTTC CAAAAGCAAT CAATATGTAT 3500 
IHKC FDE X B A Y I C G MFC ACI YFAE NSS RSN 0 Y V Y 

ATGCAATTGC AGGAGGTACT GGGTGTCCAG TTCACAAAGA CAGATCTTCT TACATTTGCC ACAGGCAGCT GCTCTTTTGC COGGTAACCT TGGGAAAGTC 3600 
GIG CGT GCPV BED X S C YICB R 0 L LFC RVTL GKS 

TTTCCTGCAG TTCA3TGCAA TGAAAATGCC ACATTCTCCT CCACCTCATC ACTCAGTCAC TGGTA3GCCC AGTGTAAATG GCCTA3CATT AGCTGAATAT 3100 
FLO FSAM RNA BSP PGHB SVT CRP SVNG LAL AEY 

GTTATTTACA GAGGAGAACA GGCTTATCCT GAGTATTTAA TTACTTACCA GATTATGAOO CC7GAAGOTA TGCTOGATGG ATAAATAGTT ATTTTAAGAA 3800 
VIYR O E 0. A Y P EYLI TYQ I H R PEGK VDO 

ACTAATTCCA CTGAACCTAA AATCATCAAA GCAGCAGTGG CCTCTAOGTT TTACTCCTTT GCTGAAAAAA AATCATCTTG CCCACACGCC TCTGGCAAAA 3900 



GGATAAAAAT GTGAACGAAG TTTAACATTC TGACTTGATA AASC7TTAAT AATGTACAGT CTTTTCTAAA TATTTCCTCT TTTTTCAOCA CTTTAACAGA 4000 



TCCCATTCCA CCTTAAACTG GCTTGTCTOt AC7AAATTAT AAACAGAGTT AACTTGAAOC TTTTATATOT TATGCATTGA TTCTAACAAA CTGTAATGCC 4100 



CTCAACkCAA CTAATTTTAC TAATACAATA CT G TCTTCTT TAAAACACAG CATTTACACT GAATACAATT TCATTTGTAA AACTGTAAAT AAGAGCTTTT 4200 



GTACTACCCC AGTATTTATT TACATTGCTT TOTAATATAA ATCTGTTTTA GAACTGCAAA AAAAAAAAAA AAAAA 
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Figure 5 
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i iiiiiimmimiit.iiHiiiiiiiiiniiiiiiiiiiiMiiinhiiiiii.iii inmiiihhiii lufiiimiim 

OCLVPUJNACSTOJnrEVTriALrKCACVJttMDUfOr^ 500 
PVT R I XJOtLS LX.M\?i'FK>tPQTH£TALKCAAAS PY FKRKD I C ELXJJUQlAjmiEKTXEr L T Pl^'AKaC^TOTWTVWTrw f-> f^KA I j*?! rS?t ? f .R ftA* T 442 

l^L'JkUi' U;iU IIIM 11 111:111 n ii. mi hi i- in ii 

DLAITOCTIAI^IHFlCQPQSHrrALHCAVAS^ 600 
CGHLQTCRLLLSYGCD PNI IS I£CTTAIiQKGHSf\rQOUO£GISlCNS CADllOIXEAAXACDVCTVXKLCTVQSVXXRDI DC RQ5TPLH FAACYNRVSW 542 

llllllllltlll ILIMMIIM Mill llhl I -hi -till. III). Il||. ||. hllllhlll llllltllllllllli 

AGKUrrCJUil^TCSDPSIISLOCT^AAOKCa^VTOILOTSTPraTC 7D0 

Evu.QBGAoyHAKDKC^wuntACSVCi^^ ^M T PL m/vKiyrrmnorit, £42 

mi 1 n 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 n 1 1 1 1 1 1 1 1 1 1 : 1 1 1 111 111 inn 1 11 inn 1 mi mi- 111 11 11 1 11 111 111 nihil in in 

EnUWGADVHAKDXOGLVPLKKAC SYGHYEYAElXVRHGASVOTABUfXFTPLHEAAAKGKYEI CKtlXKHGADPTKJOlRTOIfTPlX^VXtcirDI QOL. 800 
LKG OAALLDAAKKGCLARVKKLS 5 FDNVNCRDTOGRKST PLKLAAG YlZNUrVAEYXUJHGAZr/HAQOXGGL I PU{HAASY(9{VDVAALL I RYNACVNATD 742 

initiiiiniiniiii.il .hMiiimi.iiiiiimitiiiiiiiihMiiiiiiiiiimiiiiiiiiiihiimiii imii 

KWkWPIJIEMQJBHXQI£ALLIJlII CADPTLXHOEGOTPLDLVSADDVSALLTAAM PP SAL PS CYK POVUJCVJIS PGATAUXLS SCP SS PSS LSAASSLD 842 

llimillllllllllllllllllllMhlllllllllll -III: III Mil III. hill ..|| hllMIMhl 

KKAFT PLHEAAQKGftTQLCU.L LAHSADPTKKDQ EGQTP U>LATADD I RALLI DAM PPEAL PTCFKFQAT WSASLISPASTPSCLSAASSID 993 

KLSGSF5ELS5WS5SCTEGASSLEXK. .EVPGVDFSriOFVIUIUnXHIiOIFEREQITl.^ 94 0 

h i ii- -i- : i 1 ■ hi 11 1.1 -hih:.iiiin mi 11111111 t n 1 . 1 1 1 1 1 1 n 1 1 n j 1 1 j 1 1 r 1 11 1 1 nn 

M.TCFLAJCLAVG(»5HA<2X»ACTIJWECEVAGL^ 1093 
"FTLTlJ.TSGSGTIUinLSPiroXirOSVI^EKOST^^ j 040 

1 1 1 1 1 • IMhlhhllhlllllllllhllllilhllllllllh:nitl Mil Ih IhlMMIIIII lllllllllihlllt 

WFYT.TraCVNQCTIIXDLAPEDKEYQSvr£!2^STIREHRTCGK^ FFI NAI j 1193 

HKG7DERHAYXGQHFGAG IYP AHJ SSKSKOYVYC I CGGTSC PVHXDRSCYICKRQLLFCKVTLGXS Fl/QFSAXKMAHSPPGHHSVTSRPSVNG LALAE YV 1 1 40 

niimiiiiiiinniiiiiniiiiiinfiiiiini iiiiihiini imniiii iiiihmmi nimiii mi 



iiiiiiiiiiiiinihii 
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101 TcctxeACTCAOcrcTCqcrrGG^^ 



GO 2 

CTAGOGCTG 200 



TTCC OCCOCCCL - - — . , - - - 

1 HI nil j || I 1 1 lit I II III I III 1 II I II I II I I IM I 

201 6cGGM»raT«C*GTCOOa*TCCGCCC^^ 



I 



XX»GOCXCCCGOGOCCCACaCAC 



II I II 



■coGOG XTC CT GC T J cccTr ac Tc ra^ TC i Tr GC T G^ 

I III! I I II I III I II 1 I I I I I I I 

Tl lX ICTC CA TegTCCCCTCGItTCGA UC 1 \\JJLVM XXXXXXiU^^ 

I Ml III II II I I I I ill I II III llll II II III I HI M II HI M IM II Mil II 

mmiiiii ii iff ii n ii ii iiii iii iii nil ii i ii minimi m ilium tin n ' 

S01 TAOTTGJU^TATTTGCTrCAGAATGGTrcAAATCTO 

i inn i m im urn i ii mill ii in mm ii ii imimm iii m mmiim in i 

697 l^ixGAACACT^AaL^^ 

n i mMiMii ii ii ii u in mi inn 1 i m mimi Minimi 

701 TGCATTGTGCTGTTACAI^TGGAG^^ 

miiiiiMit i mil iimm mi in n n n him i nn mi 1 imiin n n urn n • 1 1 r 

B01 CTGCTGAATWAAG***CA^ 

i mmii iiiiiiii linn inn m niimi miMimii i mm mi n 11 mint n him n 

997 CXGGTt^T*CJJUSWU«»CaU«TCCTAOAAOCTGCT 

901 AAG TGATGGCAGAAAGTCAACTCCATTACATTTQGCKGCJLO^ 

Miiinii iimiii inn mm i nut 11 it mimi h in > >' ; i 1 1 1 1 in n m m n m 1 1 n 

1097 AAgTOATGGGCGAAACTta ^rai TrrACATCT^ i CAGCA7CGTGCTGAfrC ITC ATGCA 

1001 AAAt^TAAAOGTCATCT^ACCATTACACAATC ^l 1 Vl 'l' l 1 1 ATOXTrCATTATe^AgTAACTCAA in ' HU, I ' UA AGCATGG'miCTGTCTAXXTOCAA 

nm mn n \H\\\ i ii imm iimi urn immmimi n inn i 1 mil inn tin 

1197 AAAGACAAAGg^ ^ 

rTAA£3TTATGCTCCACACCCAACACTGCT 

i i n mi ii ii mil 

TACTTA2CCXTOCCCCTCATCCTACGTTACT 



1101 TGGACTTCTOGCJUlTreACTCCTCro 

im i inn n inn ii n inn nut mill in inn 11 

1297 TXJGATCTCTGCCACTTITVCTCCACTGCACGAGGCTGCTTC CAAGAACCGTGTAC 
1201 C wmtrTC*CAATAAAAC^AT*y^^ 

iii n ii 1 1 1 1 1 m 1 1 t ii limn M ii nil n inn i imiimiim n mm ii inn n 

l 3 9 7 CWCTCCO.TGGCJUUUkCTG^TXWTATGCCTCCAACTCCCGX^ 

ijOl C GA5AA3CTGATCTT AC 7CGAATtLAAAAAACATCTCTCTC7GGAAAT3G TGAATTTCAAGC ATCC TCAAACACA TCAAA CAGCATTGCAT TGTCCTGCTG 

Tim mii 1 11 1 1 mm m iiiiiimi nnii n n i niiiiiniii im iiiimi 

1497 AaA<^U^ACACTTACCTAAACTTJUJUUUU^<-iX.^ i\.i\J ^ KMTCXTTMTTKMACA^<ttCM*7C7*XTG&MEXGCACTGCttTGTGCTGTGQ 
1401 eATCTCCATATCCCAAAAiC*JUU?CAAATAT^^ 

\ im^ * nrniunui 1 m 1111 11 MiiiMMimi 1 iiiiiiii 1 mm in in mi im ■ ; ,; 

1597 cCTCrCTt^TCCOJJ^CCTAAACAJ^ 
1501 CCCXTCTV&JMSCTCX™ 

in 1 11 1 111 milium i inn n n nil n in 1 iiiii inn i niiiiiniii n t n i" 

1601 GCTCCATATTCrrOOTCATci'ACAAAl.V. IO C CCCC T ACTCCTCAOCTATOGCTOTC^TOCTAACATT ATATCCCTTCAiA<OC '1' 1 1*. i*. U I AC*GAJ0C 

n 11 ti n n 11 minimi n mn 11 n i in n m n m i n mn 11 n iiiiiiii 

1797 CCCCCCCTACCACOXACCTCCAGAC^^ 

1701 CAAATGAAAATGTACAQCAACTCCTCCAAGA3GCTATCTC 

I mm 11 inn i n in in t n in in 11 I n i t i i n 11 i n iiiiiiii i ii 

1697 CCAATCAAGCACIXXJ^AOATX^^ 
1801 AACTCTAAAAAAJICTGTCTACTC^ 

mm 11 im 11 1 m 1 in 11 11 mm 1 n n n n n n n i n n mn n it in t in 



1901 TCCCTIWTW^TATrarrACJ^TQCA^ 

n 11 11 11 11 mimi 11 11 11 mn mn 11111 11 11 m 1 11 n 1 11 mn nnj nnniur' 

2097 TCTClTCT A GACTACCTCXrTACACCACCOrTGCCGATCTCCA* " " * " 



2000 

TAAlTCCTCm^TATGOACACTATC 2196 
2100 
2296 



2001 AAGTTCCJUSAACTTCTIOTTAAA^ 

1 11 11 n m 1 11 1 mn 11 11 mn 1 1 11 iimmmm m i iminmim immi mn 

2197 AGCTCCCTGAGCTTTTACTAAGG 
2101 JLATTTCCAAACTTCTO^Ca^ 2200 
2297 AATCTOCAACC ICC TTT1 AAAACATOGaUSCASATT 



102 
297 
200 
397 
300 
497 
400 
596 
500 
696 
600 
79S 
700 
896 



900 

1096 

1000 

1196 

1100 

1296 

1200 

1396 

1300 

1496 

uoo 

1S96 
1500 
1696 
1600 
1796 
1700 
1896 
1800 
1996 
1900 
2096 



ATAjCJUCCTTTGGATTTGCTAAAGGAADG^GACACAGATATT 2396 
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2201 CAAG A T l T UL T TA COOfflU^TOCA OC IT rGClA S*^ 2300 

ii ii i ii i ii him mhm i iiimimimiM n i n mm inn i i i mi hum h hi 

2391 atfgUCTTAataAACQOa UtX T ULlX T C T l\^ ItyrC»JUVmUWGQCCTQCCT^^ 2496 

2301 GCt»TACCCAA<3CCAGACATTOUCACC^ 2400 

I II Hill IHHI HIIIIMIMII IHIII H 1 1 1 1 1 1 1 1 I Mill II II II I I II I II 11111111111 H 

2497 OUaCMXCJUTCGCAGAAATTCMCC^^ 2596 



2401 TGXCMSSACMJOMX^^ 

mm mi i n n i nun i mimi mum ihiiih n mi in n i tint n n i tmi n 

2591 TCCeCAaSACAAC UyjX^ TITJaTTC C rCn 2696 
2501 GCXXCXSaUZXJJi'lVJLASn'IlJiCJ^ 2600 

n ii n n mil it n it i ii minimi n m t 11 nni unit mil inn n inn n 

2697 GCAACA(»TAA C T« Kre T m CTCCarit XA T^^ 2796 
2601 TT\AAJSftTrMT<iftAffi ft C A AfrCft ' 1 "IT I *r r Lr^^LrjT-rTrtr:rrv^Tr^Tr?rc%cxra m ttttt t< ^r*r^xrrr~*TCX~cvy^\Ti^rx-TC~K m j'T-(~T\~^?c. 2700 

i it ii 1 1 1 1 1 1 1 f ii n hi t 111 it mi ii 111 tin m i im n illinium n i n i n 

2197 TCAAaAACCAflGAACOCCAI^l\,lCra 2B96 
2701 TTACAAGCCTCAAGTCCI^TSGT'OI^^ K IHA GCCAOC 2800 

ti ii iiiii i i ii mil t mi m im i m n mi n 11 mm 

2897 TTTTAAACCTCA . . . GQCTACTGTAGTCAG TOCCTCTCT O ATCTCACCJUXXTCCJ re 2975 

2801 AirrCTTGACAACTTATCTGGGAt, rrrt - ni ACUAC re iCTr CA OTAGTrA(^^ GACG 2B94 

II i mm I till HI llll II I H II IM III H II II I I II I III II I 

2976 ACCATACACAACCTCIUrrOCKXCTrrACCXCau n W -U. 1 AOGAOt^CCCTCCAATCXAOGOSATO^^ 3075 

2895 7TCCAOGAC™»TlTTAa^TAA^^ 2994 

3076 jjjj^GTtlUACAl^^ IJl JLiiUlIli^ 3175 

2995 ACnTGAGATCGGGCACAAOGAGCTOAAQGAIlATTG^ 3094 

i in nm ii i ill tin ii ii ii niiini Hill ii i mm mi mum n inn i nun 

3176 ggctwtatggotcatgaagacttcuaacaa^^ 3275 

3095 C AAgrrCTTAACCgATATTTAACTCTGAACACIL.TC.'ItSG T AOTQGAACAATTCTTATACATCTG rCTCCTSATGATAAA(^GTITCACTCTTITOQAOGAAO 3194 

mil ii ii iiiii inn ii t i nm m i t nm mi n mum t mitt inn im 

3276 CAASGCACCAATCCTTATTTCACTrTTCAC^ 3375 
3195 AjGATGCAAACTACXCTTCGAGAGCACAG^ 3294 

iiimiimii 1 1 1 1 1 1 1 i m 1 1 m it 1 1 in im it n iiiiiiimiimm i i iiiii n m iimm 

3376 AGAT«yUUtCTACTATraAauCAC*GA^ 3475 
*295 ACTATCGCAAAg\TAClLJCTCACCOCW^^ 3394 

i mi mi nm ii iiiii inn ii ii iminii mm n i n i mi iiii mimi i m 

3476 GTrGAGQGACCOCTTCT^ C OCACCGACASAAOGAAOTGTCTCAGGAGAATCACA^ iTCATTAAT 3 575 

3395 GCAATTATrxyKJUUaarnTGATGA^ 3*94 

u nm ii iii in i lint mum ii ijnnn nn 

3 576 GCCATTATTCATAAAGGCTTTGATt^GCGACATKATACATAGGA^ 3675 
3495 ATCTATATCCAATTCCAGGAGGT ACTOOOTtjTCCAGTTCACAAAGACAGAT^ 0CCSTAACCTPC3G 3594 

mi it ii ii ii it n iiiii nm n n u n it nm it miiii n i n m n 

3676 AT<?TWATGGAArrG«UOGAGQAACAQCCTXXCCTACA^ 377 5 

359S AAAGiVrnx X RVCACTTCACTCCAATC^^ 3 "< 

n ii ii iimm it i imiiiim u i mum n muiiui nun 11 n it inn it mi in 

3776 GAAATCCTTTCTGCAGTT^ 3 « 7 * 
3695 GAATATCTTATTTACAGAGGAGAACAGGCTTATCCTGAt^^ 3794 

1 1 1 1 1 1 1 1 ji iiiiiiiiiiHiini n u mm i n mmum mi in im i i n u ^ 

3876 GAATATCTCATCTACACAOGACAACACGCATAOCCACUkGTATCTrATCACTTACCAt^ . . . CCG 3972 

3795 TAACAAACrAATTCCACTGAACCTAAAATCATCAAAG . CACCACTCCCO'CT A CGTTTrA L JU. IVi AAAAAAAATCAT^CTTGCCCACAGOCCTCT 3893 

I llll llll I MM III I II H I III II I H lit I I III II 

3973 CACaGCAaAACACCTASTCAATCCCTCCT GGTO AACC^^ A ATAACAACATCAATAT 4072 

3694 GGCAAAACGATAAAAATTTrCAACGAACTTTAACATTC TGACTTGAT AAA GCTTTAAT AATCTACAU l t,J J 1 1CI AAAT A T nil PCTTTT1 ilA CCACTT 3 993 

i iii iii iii i ii i i min n ii u 

4073 TCTAGAAGTCCCTGACAGCCTA . QAAA7AAU. TUFT TO ICTTC TATAAAGCATTGCTATACTG 4134 
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SSQ. ID NO:S 

GGCAGGAGGG GCCTTGCCAG CTTCCGCCGC CGCGTCGTTT CAGGACCCGG ACGGCGGATT 
CCGCCGCCGC GGGGCAGCCG GGGGGCAGGG AGCCCATCGA GGGGCGCGCG TGGGCGCGGC 
CGCCGGATCC GGTGACAGCA GGGAGCCAAG CGGCCCGGGC CCTGAGCGCG TCTTCTCCGG 
CCTCCTGCTC GCGGGGCCGG GGCTCCTGCT CCGGTTGCTG GCGCTGTTGC TGGCTGTGGC 
ATCATGTCGG GTCGCCGCTG CCCCGGCGGG GGAGCGGCCT GCGCGAGCGC CGCGGCCGAG 
CGGCCGCCCG AGAGCTGTTC GAGGCGTGCC GCAACGGGGA CGTCGAACGA GTCAAGAGGC 
TGAGAAGGTG AACAGCCGCG ACACGGCGGG CAGGAAArCC ACCCCGCTGC ACTTCGCCGC 
CGGAAAGACG TAGTTGAATA TTTGCTTCAG AATGGTGCAA ATGTCCAAGC ACGTGATGAT 
TTCCTCTTCA TAATGCATGC TCTTTTGGTC ATGCTGAAGT AGTCAATCTC CTTTTGCGAC 
CCCCAATGCT CGAGATAATT GGAATTATAC TCCTCTCCAT GAAGCTGCAA TTAAAGGAAA 
TGCATTGTGC TGTTACAGCA TGGAGCTGAG CCAACCATCC GAAATACAGA TGGAAGGACA 
TAGCAGATCC ATCTGCCAAA GCAGTGCTTA CTGGTGAATA TAAGAAAGAT GAACTCTTAG 
GAGTGGCAAT GAACAAAAAA TGATGGCTCT ACTCACACCA TTAAATGTCA ACTGCCACGC 
AGAAAGTCAA CTCCATTACA TTTGGCAGCA GGATATAACA GAGTAAAGAT TGTACAGCTG 
ATGGAGCTGA TGTCCATCCT AAAGATAAAG GTGATCTGGT ACCATTACAC AATGCCTGTT 
TTATGAAGTA ACTGAACTTT TCGTCAAGCA TGGTGCCTGT GTAAATGCAA TGGACTTGTG 
CCTCTTCATG AGGCAGCTTC TAAGAACAGG GTTGAAGTAT CTTCTCTTCT CTTAACTTAT 
CAACACTGCT CAATTGTCAC AATAAAAGTC CTATACACTT GGCTCCCACA CCACAGTTAA 
AGCATATGAA TTTAAAGGCC ACTCGTTGCT GCAAGCTGCA CGAGAAGCTC ATGTTACTCG 
CATCTCTCTC TGGAAATGGT GAATTTCAAG CATCCTCAAA CACATGAAAC AGCATTGCAT 
CATCTCCATA TCCCAAAAGA AAGCAAATAT GTGAACTGTT GCTAAGAAAA GGAGCAAACA 
GACTAAAGAA TTCTTGACTC CTCTGCACGT GGCATCTGAG AAAGCTCATA ATGATGTTGT 
GTGAAACATG AAGCAAAGGT TAATGCTCTG CATAATCTTG GTCAGACTTC TCTACACAGA 
GTGGTCATCT ACAAACCTGC CGCCTACTCC TGAGCTATGG GTGTGATCCT AACATTATAT 
CTTTACTGCT TTACAGATGG GAAATGAAAA TGTACAGCAA CTCCTCCAAG AGGGTATCTC 
TCAGAGGCAG ACAGACAATT GCTGGAAGCT GCAAAGGCTG GAGATGTCGA AACTGTAAAA 
CTGTTCAGAG TGTCAACTGC AGAGACATTG AAGGGCGTCA GTCTACACCA CTTCATTTTG 
TAACAGAGTG TCCGTCGTGG AATATCTGCT ACAGCATGGA GCTGATGTGC ATGCTAAAGA 
CTTGTACCTT TGCACAATGC ATGTTCTTAT GGACATTATG AAGTTGCAGA ACTTCTTGTT 
CAGTAGTTAA TGTAGCTGAT TTATGGAAAT TTACACCTTT ACATGAAGCA GCAGCAAAAG 
AATTTGCAAA CTTCTGCTCC AGCATGGTGC AGACCCTACA AAAAAAAACA GGGATGGAAA 
GATCTTGTTA AAGATGGAGA TACAGATATT CAAGATCTGC TTAGGGGAGA TGCAGCTTTG 
CCAAGAAGGG TTGTTTAGCC AGAGTGAAGA AGTTGTCTTC TCCTGATAAT GTAAATTGCC 
AGGCAGACAT TCAACACCTT TACATTTAGC AGCTGGTTAT AATAATTTAG AAGTTGCAGA 
CAACACGGAG CTGATGTGAA TGCCCAAGAC AAAGGAGGAC TTATTCCTTT ACATAATGCA 
GGCATGTAGA TGTAGCAGCT CTACTAATAA AGTATAATGC ATGTGTCAAT GCCACGGACA 
CACACCTTTG CACGAAGCAG CCCAAAAGGG ACGAACACAG CTTTGTGCTT TGTTGCTAGC 
GACCCGACTC TTAAAAATCA GGAAGGACAA ACACCTTTAG ATTTAGTTTC AGCGGATGAT 
TTCTGACAGC AGCCATGCCC CCATCTGCTC TGCCCTCTTG TTACAAGCCT CAAGTGCTCA 
AAGCCCAGGA GCCACTGCAG ATGCTCTCTC TTCAGGTCCA TCTAGCCCAT CAAGCCTTTC 
AGTCTTGACA ACTTATCTGG GAGTTTTTCA GAACTGTCTT CAGTAGTTAG TTCAAGTGGA 
CTTCCAGTTT GGAGAAAAAG GAGGTTCCAG GAGTAGATTT TAGCATAACT CAATTCGTAA 
ACTTGAGCAC CTAA TGGATA TATTTGAGAG AGAACAGATC ACTTTGGATG TATTAGTTGA 
AAGGAGCTGA AGGAGATTGG AATCAATGCT TATGGACATA GGCACAAACT AAT TAAAGG A 
TTATCTCCGG ACAACAAGGT CTTAACCCAT ATTTAACTTT GAACACCTCT GGTAGTGGAA 
AGATCTGTCT CCTGATGATA AAGAGTTTCA GTCTGTGGAG GAAGAGATGC AAAGTACAGT 
AGAGATGGAG GTCATGCAGG TGGAATCTTC AACAGATACA ATATTCTCAA GATTCAGAAG 
AGAAACTATG GGAAAGATAC ACTCACCGGA GAAAAGAAGT TTCTGAAGAA AACCACAACC 
ACGAATGCTA TTTCATGGGT CTCCTTTTGT GAATGCAATT ATCCACAAAG GCTTTGATGA 
TACATAGGTG GTATGTTTGG AGCTGGCATT TATTTTGCTG AAAACTCTTC CAAAAGCAAT 
ATGGAATTGG AGGAGGTACT GGGTGTCCAG TTCACAAAGA CAGATCTTGT TACATTTGCC 
GCTCTTTTGC CGGGTAACCT TGGGAAAGTC TTTCCTGCAG TTCAGTGCAA TGAAAATGGC 
CCAGGTCATC ACTCAGTCAC TGGTAGGCCC AGTGTAAATG GCCTAGCATT AGCTGAATAT 
GAGGAGAACA GGCTTATCCT GAGTATTTAA TTACTTACCA GATTATGAGG CCTGAAGGTA 
ATAAATAGTT ATTTTAAGAA ACTAATTCCA CTGAACCTAA AATCATCAAA GCAGCAGTGG 
TTACTCCTTT GCTGAAAAAA AATCATCTTG CCCACAGGCC TGTGGCAAAA GGATAAAAAT 
TTTAACATTC TGACTTGATA AAGCTTTAAT AATGTACAGT GTTTTCTAAA TATTTCCTGT 
CTTTAACAGA TGCCATTCCA GGTTAAACTG GGTTGTCTGT ACTAAATTAT AAACAGAGTT 
TTTTATATGT TATGCATTGA TTCTAACAAA CTCTAATCCC CTCAACAGAA CTAATTTTAC 
CTGTGTTCTT TAAAACACAG CATTTACACT GAATACAATT TCATTTGTAA AACTGTAAAT 
GTACTAGCCC AGTATTTATT TACATTGCTT TGTAATATAA ATCTGTTTTA GAACTGCAAA 



CGCGCTGCCT 
CATGGGACTG 
GGGGCCTCGC 
GGCGGCCAGG 
GCCGTGGAGC 
TGGTGACGCC 
AGGTTTTGGG 
GGGGGCCTTA 
ATGGTGCAGA 
GATTGATGTT 
GCATTGGATT 
AAAGTGCCAG 
AAGTGATGGC 
TTACTGCAAC 
CTTATGGTCA 
G C AATTC ACT 
GGTGCAGACC 
AAGAAAGATT 
AATCAAAAAA 
TGTGCTGCTG 
TCAATGAAAA 
TGAAGTAGTG 
GCTGCATATT 
CCCTTCAGGG 
ATTAGGTAAT 
AAACTGTGTA 
CAGCTGGGTA 
TAAAGGAGGC 
AAACATGGAG • 
GAAAATATGA 
TACTCCTTTG 
CTAGATGCTG 
GCGATACCCA 
GTATTTGTTA 
GCATCTTACG 
AATGGGCTTT 
CCATGGAGCT 
GTCAGCGCTC 
ATGGTGTGAG 
TGCAGCCAGC 
ACAGAGGGTG 
GGAATCTTGG 
GATGGGGCAC 
GTCGAGAGAC 
CAATTCTTAT 
TCGAGAGCAC 
GTTTG TAACA 
ATGCCAATGA 
AACGCATGCG 
CAATATGTAT 
ACAGGCAGCT 
ACATTCTCCT 
GTTATTTACA 
TGGTCGATGG 
CCTCTACGTT 
GTGAACGAAG 
TTTTTCAGCA 
AACTTGAACC 
TAATACAATA 
AAGAGCTTTT 
AAAAA 



SEQ. ID NO: 6 



SRDTAGRKST 
FGHAEWNIX 
TTROTDGRTA 
NVNCHASDGR 
ACSYGHYBVT 
ADPTLLNCHN 



AACASAAAEA 
PLH7AAGFGR 
LRKGADPNAR 
LDLADPSAKA 
KSTPLHLAAG 
ELLVKHGACV 
KSAIDLAPTP 



VEPAARELFE 
KDWBYLI/3N 
DNWNYTPLHE 
VLTGEYKKDE 
YNRVKIVQLL 
NAMDLWQFTP 
QLKERLAYEP 



ACRNGDVERV 
GANVOARDDG 
AAIKGKIDVC 
LLE5ARSGNB 
LQHGADVHAK 
LKEAASKNRV 
KGHSLLQAAR 



KRLVTPEKVN 
GLIPLHNACS 
IVLLQHGAEP 
EKMMALLTPL 
DKGDLVPLHN 
EVCSLLLSYG 
EADVTRIKKH 



SO 
100 
150 
200 
250 
300 
3S0 
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LSLENVNFKH PQTOBTAUtC AAASPYPKRK QICELLLRKG ANINEXTKEF 400 

LTPLHVAS2K AHNDWEVW KHEAKVNALD NLGQTSLHRA AYCGKLQTCR 450 

LLLSYGCDFN IISLQGFTAL QKGNENVQQL LQEGISLGNS EADRQLLEAA 500 

XAGDVBTVXK LCTVQSVKCR DIBGRQSTPL HFAAGYKRVS WEYLLQHGA 550 

DVHAXDXGGL VPLHNACSYG HYEVAELLVX HGAWNVADL KKFTPLHEAA 600 

AXGKYEICKL LLQHGADPTK KNRDGNTPLD LVKDGDTDIQ DLLRGDAALL 650 

DAAKXGCLAR VKKLSSPDNV NCRDTQGRHS TPLHIAAGYN NLEVABYLLQ 700 

HGADVNAQDX GGLIPLHNAA SYGHVDVAAL LIXYNACVNA TDKWAFTPLH 750 

EAAQKGRTQL CALLLAKGAD PTLXNQEGQT PLDLVSADDV SALLTAAMPP 800 

SALPSCYKPO VLNGVRSPGA TADALSSGPS SPSSLSAASS LONLSGSFSE 850 

LSSWSSSGT EGAS SLOCKS VPGVDFSITQ FVRKLGliEML MDIFEREQIT 900 

LDVLVEMGHK BLKEIGINAY GHRHKLIKGV ERLISGQQGL NPYLTLNTSG 950 

SGTILIDLSP DDKKPQSVEK EMQSTVREHR DGGHAGGIFN RYNILKIQKV 1000 

CNKKLWERYT HRRKEVSEEN KNHANERMLF KGSPPVNAII HKGFDERHAY 1050 

IGGMFGAGIY FAENSSKSNQ YVYGIGGGTG CPVHKDRSCY ICHRQLLPCR 1100 

VTLGXSFLQF SAMKMAHSPP GHHSVTGRPS VNGLALAEYV IYRGEQAYPE 1150 

YLITYQIMRP EGKVDG 1166 

SEQ. ID K0:7 (Tankyrase I DNA sequence, GenBank Accession No. AF0B2556) 

1 cgaagatggc ggcgtcgcgt cgctctcagc atcatcacca ccatcatcaa caacagctcc 
61 agcccgcccc aggggcttca gcgccgccgc cgccacctcc tcccccactc agccctggcc 
121 tggccccggg gaccacccca gcctctccca cggccagcgg cctggccccc ttcgcctccc 
181 cgcggcacgg cccagcgctg ccggaggggg atggcagtcg ggatccgccc gacaggcccc 
241 gatccccgga cccggttgac ggtaccagct gtegcagtac caccagcaca atctgtaccg 
301 tcgccgccgc tcccgtggtc ccagcggttt ctacttcatc tgccgctggg gtcgctccca 
361 acccagccgg cagtggcagt aacaattcac cgccgtcctc ttcttccccg acttcctcct 
421 catcttcctc tccatcctcc cctggatcga gcctggcgga gagccccgag gcggccggag 
481 ttagcagcac agcaccactg gggcctgggg cagcaggacc tgggacaggg gtcccagcag 
541 tgagcggggc cctacgggaa ccgctggagg cctgtcgcaa tggggaegtg tcccgggtaa 
601 agaggctggt ggacgcggca aacgtaaatg caaaggacat ggccggccgg aagtcttctc 
661 ccctgcactt cgctgcaggt ttcggaagga aggatgttgt agaacaccta ctacagatgg 
721 gtgctaatgt ccacgctcgt gatgatggag gtctcatccc gcttcataat gcctgttctt 
781 ttggccatgc tgaggttgtg agtctgttat tgtgccaagg agctgatcca aacgccaggg 
841 ataactggaa ctatacacct ctgcatgaag ctgctattaa agggaagatc gatgrgtgca 
901 ttgtgctgct gcagcacgga gctgacccaa acattcggaa cactgatggg aaatcagccc 
961 tggacctggc agatccttca gcaaaagctg tccttacagg tgaatacaag aaagacgaac 
1021 tcctagaagc tgctaggagt ggtaatgaag aaaaactaat ggctttactg actcctctaa 
1081 atgtgaattg ccatgcaagt gatgggcgaa agtcgactcc tttacatcta gcagcgggct 
1141 acaacagagt tcgaatagtt cagcttcctc ttcagcatgg tgctgatgtt catgcaaaag 
1201 acaaaggtgg acttgtgcct cttcataatg catgttcata tggacattat gaagtcacag 
1261 aactgctact aaagcatgga gcttgtgtta atgccatgga tctctggcag tttactccac 
1321 tgcacgaggc tgcctccaag aaccgtgcag aagtctgctc tttgttactt agccatggcg 
1381 ctgatcctac gbtagtcaac tgccatggca aaagtgctgt ggatatggct ccaactccgg 
1441 agcttaggga gagattgact tatgaattta aaggtcattc tttactacaa gcagccagag 
1501 aagcagactt agctaaagtc aaaaaaacac tcgctctgga aatcattaat ttcaaacaac 
1561 cgcagrtctca tgaaacagca ctgcactgtg ctgtggcctc tctgcatccc aaacgtaaac 
1621 aagtgacaga attgttactt agaaaaggag caaatgttaa tgaaaaaaat aaagatttca 
1681 tgactcccct gcatgttgca gccgaaagag cccataatga tgtcatggaa gttctgcata 
1741 agcatggcgc caagatgaat gcactggaca cccttggtca gactgctttg catagagccg 
1801 ccccagcagg ccacctgcag acctgccgcc tcctgctgag ttacggctct gacccctcca 
1861 tcatctcctt acaaggcttc acagcagcac agatgggcaa tgaagcagtg cagcagattc 
1921 tgagtgagag tacacctata cgtacttctg atgttgatta tcgactccta gaggcatcta 
1981 aagctggaga cctggaaact gtgaagcaac tttgcagctc tcaaaatgtg aattgtagag 
2041 acttagaggg ccggcattcc acgcccttac acttcgcagc aggctacaac cgcgtgtctg 
2101 ttgtagagta cccgctacac cacggtgccg atgtccatgc caaagacaag ggtggcttgg 
2161 tgccccttca taatgcctgt tcatatggac actatgaggt ggctgagctt ttagtaaggc 
2221 atggggcttc tgtcaatgtg gcggacttat ggaaatttac ccctctccat gaagcagcag 
2281 ctaaaggaaa gtatgaaatc tgcaagctcc ttttaaaaca tggagcagat ccaactaaaa 
2341 agaacagaga tggaaataca cctttggatt tggtaaagga aggagacaca gatattcagg 
2401 acttactgaa aggggatgct gctttgttgg atgctgccaa gaagggctgc ctggcaagag 
2461 tgcagaagct ctgtacccca gagaatatca actgcagaga cacccagggc agaaattcaa 
2521 cccctctgca cctggcagca ggctataata acctggaagt agctgaatat cttctagagc 
2581 atggagctga tgttaatgcc caggacaagg gtggtttaat tec tct teat aatgeggcat 
2641 cttatgggca tgttgacata geggctttat tgataaaata caacacgtgt gtaaatgcaa 
2701 cagataagtg ggcgtttact cccctccatg aagcagccca gaaaggaagg aegcagctgt 
2761 gcgccctccc. cctagcgcat ggtgcagacc ccaccatgaa gaaccaggaa ggccagacgc 
2821 ctctggatct ggcaacagct gacgatatca gagccctgcc gaeagatgee acgcccccag 
2881 aggccttacc tacctgtttt aaacctcagg ctactgtagt gagtgectet ctgatctcac 
2941 cagcatccac cccctcctgc ctctcggctg ccagcagcat agacaacctc actggccctt 
3001 tagcagagtt ggccgtagga ggagcctcca aegcagggga tggcgccgcg ggaacagaaa 
3061 ggaaggaagg agaagttget ggtcttgaca tgaacatcag ccaatttcca aaaagccttg 
3121 gecttgaaca ecttegggat atctttgaaa cagaacagat tacactagat gtgttggctg 
3181 atatgggtca tgaagagttg aaagaaatag geatcaatge atatgggcac cgccacaaat 
3241 taaccaaagg agcagaaaga ctcttaggtg gacaacaagg caccaatcct tattcgactt 
3301 ttcactgtgt taatcaggga aegattttge tggaccttgc tccagaagac aaagaacatc 
3361 agtcagtgga agaagagatg caaagtacta ctegagaaca cagagatggt ggtaatgctg 



2 



WO 00/6,813 PCT/USOO/09558 

llll SESS^ caacagatac aatgtcattc gaattcaaaa agttgtcaac aagaagttga 
lil, f™*? 9 ^ ctgccaccga cagaaggaag tgtctgagga gaatcacaac catcacaatg 
3541 agcgcatgtt gtctcatggt tctcctttca ttaatgccat tattcacaaa gggtttgatS 
3601 agcgacatgc atacatagga ggaatgtttg gggccgggat ttattttgct gaaaactcct 
3661 caaaaagcaa ccaatatgtc tatggaattg gaggaggaac aggctgccct acacacalgg 
llll ac ??f* at9 ctatatatgt cacagacaaa tgctcttctg tagagtgacc cttg^S? 
llll " fc " Ct « Ca Stttagcacc atgaaaatgg cccacgcgcc tccagggcac cactSgtca 
3841 ttggtagacc gagcgtcaat gggctggcat atgctgaata tgteitctac agag^ga^c 

3961 tlltTrUt !r gtatCtt atcacttacc ^tcatgaa gccagaagcc c^Scaga 
3961 ecgcaacagc cgcagagcag aagacctagt gaatgcctgc tggtgaaggc cagatcagat 
4021 ttcaacctgg gactggatta cagaggattg tttetaataa caacatcilt attctagCg 
4081 tccctgacag cctagaaata agctgtttgt cttctataaa gcattgctat agtg 

SEQ. ID NO: 8 (Tankyrase I protein sequence, GenBank Accession No. AF0825S6) 

MAASRRSQHHHHHHQQQLQPAPGASAPPPPPPPPLSPGLAPGTT 
PASPTASGIJVPFASPRKGLALPEGDGSRDPPDRPRSPDPWGTSCCSTTSTICTVAAA 
PVVPAVSTSSAAGVAPNPACSGSNNSPSSSSSPTSSSSSSPSSPGSSLAESPEAAGVS 
STAPI/5PGAAGFGTGWAVSGAUlEIJJ2AaU^^ 

PLJffAAGFXSRKDVVEHLLQljGANVHARDDGGLI PLHNACSFGHABWSLLLCQGAOPN 
ARDNKNYTPLHEAAIXGKI DVCI VLLQHGADPNI RNTDGKSALDLADPSAKAVLTGEY 
KKDELLEAARSGNEEKLMAIiTPIJJWCHASI^^ 
ADVHAJCIJKGGLWIJDiACSYGHYEVTEliLXHGA^ 

CSL^SHGADPTLVNCHGKSAVDMAPTPELRBRLTYEFKGHSLLOAARZADIJUCV^ 
IJILEIINPXQPQSHCTALHCAVASUIPKRJCQVTEI^ 

ERAHNDVMEVimHGAKMNALDTLGOTAI^RAAI^^ j SLQG 

FTAAOMGNEAVOOILSBSTPIRTSr^YRIXEASKAGDLETVKQLCSSOhTWCRDLEG 

RHSTPLKFAAGY>mvSWEYI^HHGADVHAKDKGGLVPLHNACSYGHYEVA£I^VRHG 

ASVNVADLVmFTPUtEAAAKGKYEICXLUJCHGADPT^ 

DI^CDAAXUJAAKKGCTARVQKLCTPENINCRDTCC^ 

LEHGADVNAQDKGCLI PLHNAASYGHVDIAAI^IRYNTCVNATDKWAFTPLHEAAQKG 
RT^LCAUJJ^GADPTMKNQEGOTPI^LATADDIRAIXIDAMPPEALPTCFK PQATW 
SASLI S PA STPSCLSAASS IDNLTGPLABLAVGGASNAGDGAAGTERKEGBVAGLDMN 
I SQFLKSLG LiEHLRDIFETEQITLDVLADMGHEELKEIG INAYGHRHXLI KGVERLLG 
GQ^TNPYLTFHCWQGTILLDLAPEDKEYQSVEE 

VIRIQXVVNEXLRERFCHRQKEVSEENHNHHNERMLFKG SPFINAI I HKGFDERHAY1 
GGMFliAGIYFA^SSKSNQYVYGIGGGTGCPTHKDRSCYICHROMLFCRVTIXjKSFLQ 
FSTOKNA^PGHHSVIGRPSVNGLAYAEYVIYRGEQAYPEYLITO 

SEQ. ID NO: 9 (ESP. GenBank Accession No. AA244138) 

1 tcgacagaca attgctggaa gctgcaaagg ctggagatgt cgaaactgta aaaaaactgt 
61 gtactgttca gagtgtcaac tgcagagaca ttgaagggcg tcagtctaca ccacttcatt 
121 ttgcagctgg gtataacaga gtgtccgtgg tggaatatct gctacagcat ggagctgatg 
181 tgcatgctaa agataaagga ggccttgtac ctttgcacaa tgcatgttct tatggacatt 
241 atgaagttgc agaacttctt gttaaacatg gagcagtagt taatgtagct gatttatgga 
301 aatttacacc tttacatgaa gcagcagcaa aaggaaaata tgaaatttgc aaacctctgc 
361 tccagcatgg tgcagaccct acaa ww-w 

SEQ. ID NO: 10 (ESP. GenBank Accession No. AW157349) 



1 
61 



tttttttaac tgtggtgtgg gagccaagtc tatagcactt ttattgtgac aattgagcag 
tgttgggtct gcaccataac ttaagagaag agaacatact Ccaaccctgt tcttagaagc 
121 tgcctcatga agaggagtga attgccacaa gtccattgca tttacacagg caccatgctt 
181 gaccaaaagt tcagttactt cataatgacc ataagaacag gcattgtgta atggtaccag 
241 atcaccttta tctttagcat ggacatcagc tccatgttgc agtaacagct gtacaatctt 
301 tactctgtta tatcctgctg ccaaatgtaa tggagttgac tttctgccat cacttgcgtg 
361 gcagctgaca tttaatggtg tgagtagagc catcattttt tctccattgc cactcctacc 
421 actttctaag agttcatctt tcttatattc accagtaagc actgctttgg cagatggatc 
481 tgctaaatcc aatgctgtcc ttccatctgt atttcggatg 
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